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Abstract 

Rigid bodies, plastic impact, persistent contact. Coulomb friction, and massless limbs are ubiquitous simplifica¬ 
tions introduced to reduce the complexity of mechanics models despite the obvious physical inaccuracies that each 
incurs individually. In concert, it is well known that the interaction of such idealized approximations can lead to 
conflicting and even paradoxical results. As robotics modeling moves from the consideration of isolated behaviors 
to the analysis of tasks requiring their composition, a mathematically tractable framework for building models that 
combine these simple approximations yet achieve reliable results is overdue. In this paper we present a formal hy¬ 
brid dynamical system model that introduces suitably restricted compositions of these familiar abstractions with the 
guarantee of consistency analogous to global existence and uniqueness in classical dynamical systems. The hybrid 
system developed here provides a discontinuous but self-consistent approximation to the continuous (though possibly 
very stiff and fast) dynamics of a physical robot undergoing intermittent impacts. The modeling choices sacrifice 
some quantitative numerical efficiencies while maintaining qualitatively correct and analytically tractable results with 
consistency guarantees promoting their use in formal reasoning about mechanism, feedback control, and behavior 
design in robots that make and break contact with their environment. 


1 Introduction 

Simple models of complex robot-world interactions are key to understanding, implementing and generalizing behav¬ 
iors as well as identifying and composing their reusable constituents to generate new behaviors (Full and Koditsehek 
1999). There is strong appeal to using familiar physical simplifications such as rigid bodies, plastic impacts, persistent 
contact. Coulomb friction, and massless limbs in building up simple robotics models. Their coarse approximation 
to the underlying physical processes of interest are widely understood to offer the right combination of analytical 
tractability and physical realism in isolation. However, it is also widely understood that such individually useful sim¬ 
plifications can introduce catastrophic side-effects when combined (e.g. in Chatterjee (1999), Dupont and Yamajako 
(1994), Keller (1986), Mason and Wang (1988), Painleve (1895), Trinkle et al. (1997) and others, as discussed in 
Section 1.2). 

In this paper we assemble a framework of reasonable physical assumptions and accompanying mechanics to de¬ 
velop a formalism for combining them at will in the construction of a simple hybrid system model for contact robotics 
that yields a provably consistent* and empirically useful approximation to many behavioral settings of interest. As 
an example of the value of such mathematical models, new work (Brill et al. 2015) uses the formal properties of our 
self-manipulation model to develop rigorous correctness (or, non-existence) proofs for desirable robot behaviors - in 
that case, gap crossing and ledge mounting. However, while the primary goal of this paper is not numerical analysis, 
simulation does provide a useful way to visualize key features of the model and the utility of some of the simplifying 
assumptions. Numerical results obtained through a custom Mathematical simulation are used throughout the paper to 
illustrate key concepts, and to suggest the fidelity to physical settings of interest. 

For example, our model generates simulations^ of the leaping behavior depicted in Figure 1 that recreate the 

*Con'esponding author; Robotics Institute, Carnegie Mellon, Pittsburgh, PA, USA, e-mail: amjl@andrew.cmu.edu 

* Here, consistent refers to a combination of properties detailed in Section 3.4 analogous to the guarantee of global existence and uniqueness of 
solutions for a classical dynamical system. 

^Wolfram Mathematica 9, Numerical integration uses the NDSolve command, event detection uses the WhenEvent command, 
http : //WWW . wolfram.com/mathematica/ 

^For this simulation the middle and rear legs are used with a maximum current limit of 20A, a pseudo-impulse (defined in Section 2.7) magnitude 
of 8t — 0.03 (hand selected to give the qualitatively best overall results), relative leg timing of t 2 — 0.01 (i.e., the middle legs are started 0.01s before 
the rear legs), and once a leg has lifted off the ground it is slowly rotated upwai'ds out of the way. Remaining model pai'ameters are as listed in 
(Johnson and Koditsehek 2013a, Sec. Ill, Appendix G). 
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Figure 1: Keyframes from RHex simulation leaping onto a 20cm ledge. Blue arrows show contact forces while the 
red arrow shows body velocity. 


empirical results of Johnson and Koditschek (2013b) qualitatively (i.e., predicts the same salient features though not 
necessarily the same metric results), yet enjoys a combination of mathematical properties that we believe will provide 
a foundation for reasoning about and thereby generalizing the platform design and control strategies that gave rise to 
such behaviors. Of course, physical fidelity is not mathematically demonstrable and the relevance of the modeling 
choices we propose (i.e., the empirical sway of this formally self-consistent model) can only be established over the 
long run in practice by the breadth of physical phenomena they usefully approximate, regardless of the simplification 
and ease of analysis they afford. 

The paper is structured as follows. This section finishes with a summary of contributions, followed by a discussion 
of their relation to prior work. Section 2 introduces the various simplifying physical modeling assumptions and draws 
out some of the mathematical consequences bearing on their relationships to alternative formulations and to each other. 
Section 3 assembles from these pieces a formal hybrid dynamical system model and proves its consistency. Section 4 
reviews the scope of physical settings admitted by our assumptions and discusses the most delicate aspects of their 
interplay with our formal results, providing additional examples that help give a broader context for the applicability 
of the theory. Section 5 concludes with some final thoughts on the implications of this work and future directions. An 
extensive Appendix works through the details of selected proofs and provides additional background material. 

1.1 Contributions of the Paper 

This paper extends a framework for manipulation (Murray et al. 1994) and self-manipulation (Johnson and Koditschek 
2013a) modeling into a formal hybrid dynamical systems specification whose discrete modes are indexed by the active 
contact constraint set in a manner guaranteed to produce a unique execution from every initial condition under mild 
conditions on the motor feedback control laws. The foundation on which we rest this physically simple and math¬ 
ematically tractable modeling framework arises from Assumptions A1-A12, introduced in Section 2 and discussed 
further in Section 4.1, comprising various familiar phenomenological representations and physically natural hypothe¬ 
ses, including: rigid bodies (Al), massless limbs (A4), plastic impact (A8), and static friction (A12). It is known 
that in general these properties are not mutually consistent, however we formally demonstrate that the particular set 
of assumptions included here provides a well defined, deterministic, and computationally well-behaved model. The 
physical fidelity may, in some important applications contexts that we point out, necessarily remain something of a 
leap of faith relative to the still incomplete state of the theory of rigid body mechanics. To the best of our knowledge 
this is the first time any succinctly stated list of physical assumptions about rigid body mechanics has been shown to 
yield a consistent hybrid dynamical system with unique and globally defined executions. 

Our central technical contribution is the derivation of a consistent extension of Lagrangian dynamics, Newtonian 
impact laws, and complementarity contact conditions to systems that have certain rank deficiencies in their inertia 
tensor that agrees with (i.e., when rank is restored, maintains equivalence to) the nonsingular case (Lemmas 4, 5, (& 8 
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and Theorems 1 & 2). The possibly massless dynamics motivate a reformulation of complementarity as a logical 
equivalence (Lemma 9) so that its unique solvability (for both force-acceleration and impulse-velocity complemen¬ 
tarity problems, Assumptions A9 & AlO, respectively) can be shown to imply a unique partition of the guard set (i.e., 
those states which are to undergo a mode transition) into disjoint components labeled deterministically by the desti¬ 
nation mode of the transition (Theorem 5). These conditions are expressed in terms of a higher order scalar relation 
(^, Definition 1), and we exhibit certain properties of this relation that clarify its role in determining the guard set 
(Lemmas 1-3). 

Even without the introduction of massless limbs there exist many opportunities for repeated (and even Zeno‘S) 
discrete transitions that seem unlikely to add much physical insight (and, speaking practically, generally degrade the 
numerical performance of simulations based upon this model). Hence, to resolve the qualitative problem of spurious 
transitions at arbitrarily low velocities (Lemma 10), we introduce a new pseudo-impulse, which acts on the discrete 
transitional logic (rather than the continuous dynamics), imposing an implicit bound on contact velocity below which 
such contacts persist (Theorem 3), precluding certain Zeno phenomena (Theorem 11). 

As a structure to combine these physical models and assumptions, this work presents the formal definition of 
the self-manipulation hybrid system in Definition 5 (along with Definitions 2-4), and the formal demonstration of its 
consistency (including that it is deterministic and non-blocking. Theorems 4-9 and Lemmas 6 & 7), incorporating a 
well-behaved notion of completion in case of a Zeno execution (Definition 6, Theorem 10, Corollary 1) by adapting to 
this more elementary setting the measure theoretic arguments of Ballard (2000). 

1.2 Relation to Prior Literature 

This paper aims to promote simplified physics based models of robotic systems for purposes of analysis. Doing so 
entails integrating results and ideas that have developed somewhat independently across several different longstanding 
technical fields. For surveys of some of these ideas (with a focus on numerical considerations), see e.g. Brogliato et al. 
(2002), Gilardi and Sharf (2002). 

1.2.1 Numerical Simulation Methods 

While this paper is focused on a model for analysis and not simulation, it is informative to consider how other simula¬ 
tion strategies compare. The model developed here generates trajectories from the flow of hybrid dynamical systems 
defined by differential-algebraic equations (DAEs) between discrete transitions and so, in the language of (Brogliato 
et al. 2002, Sec. 6.3), simulations of these trajectories could be obtained^ via an event-driven scheme, as opposed to a 
penalized-constraint/continuous-contact scheme, or a time-stepping scheme. 

Event-driven schemes have a long history, e.g. (Wehage and Haug 1982), (Pfeiffer and Glocker 1996), (Brogliato 
et al. 2002, Sec. 6.7), and include the hybrid dynamical systems formulations outlined in the next section. Typically 
they entail alternating between integration of smooth dynamics involving (usually) finite forces from contacts and 
the discontinuous handling of constraint addition or deletion (the “events”). Here, we extend these methods and 
codify the event-driven scheme in terms of a formal hybrid dynamical system. In contrast, some event-driven schemes 
formulate the contact dynamics as always consisting of impulses, e.g. Mirtich and Canny (1995). These impulse- 
based simulations combine both smooth and discontinuous contact interactions into impulses, with a continuous-time 
ballistic trajectory in between events. 

Time-stepping schemes, which also account for contact interactions only using impulses by integrating applied 
forces over small time steps, are numerically efficient especially for systems with large numbers of constraints, see e.g. 
(Stewart and Trinkle 1996) (Anitescu and Potra 1997), or (Brogliato et al. 2002, Sec. 7). These models can be relaxed 
(by allowing contact forces and impulses to arise even before contact occurs) to enable efficient numerical simulation 
and motion synthesis (Drumwright and Shell 2011, Todorov et al. 2012). These methods allow contact constraints to 
be added or removed at any time step, but only once per time step. Furthermore, no distinction between continuous 
contact forces and discontinuous impulses is made. In this way these methods relax the requirements of the Principle 
of Constraints, i.e. that, “Constraints shall be maintained by forces, so long as this is possible; otherwise, and only 
otherwise, by impulses” (Kilmister and Reeve 1966, p. 79) (as noted e.g. in (Stewart and Trinkle 1996, Section 1)). 
Their advantage in avoiding many of the well explored physical paradoxes of rigid body mechanics (including Zeno 

execution of a hybrid dynamical system exhibits the Zeno phenomena if it undergoes an infinite number of discrete or logical switches in 
finite time (Definition 6). 

^E.g. using the algorithm proposed in Burden, Gonzalez, Vasudevan, Bajcsy and Sastiy (2015) for hybrid control systems. 
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phenomena (Drumwright 2010) as well as apparent contradictions between frictional forces and impulses discussed 
in Section 1.2.5) seems to come at the cost of persistence of contact. In contrast, here, persistence is one of the key 
simplifying modeling assumptions, expressing our intuitive experience of limbs interacting with the world, enabling 
some of the other assumptions, and affording our strong formal results. Despite being targeted at a different numerical 
integration scheme, many of the results in this paper, such as the consistent handling of massless limbs, are potentially 
applicable to time-stepping schemes. 

1.2.2 Hybrid Dynamical Systems 

This paper models manipulation and self-manipulation systems using a hybrid systems paradigm that assumes instan¬ 
taneous transitions. Though we develop our (so-called) self-manipulation hybrid dynamical system for a similar class 
of mechanical systems as that considered in (van der Schaft and Schumacher 1998, Ex. 3.3), we specialize from the 
more general class of hybrid automata considered in (Lygeros et al. 2003, Def. II. 1) to facilitate connections with 
the broader hybrid systems literature. Our self-manipulation system is closely related to the n-dimensional hybrid 
system of (Simic et al. 2005, Def. 2.1), the simple hybrid system of (Or and Ames 2011, Def. 1), and hybrid dynamical 
system of (Burden, Revzen and Sastry 2015, Def. 1) as we require: (i) multiple disjoint domains of varying dimen¬ 
sion, disallowed by Or and Ames (2011), Simic et al. (2005); (ii) guards with arbitrary codimension, disallowed by 
Burden, Revzen and Sastry (2015); and we desire (iii) more analytical and geometric structure than is provided by 
the general framework in Lygeros et al. (2003), specifically domains that are differentiable manifolds and guards that 
are sub-analytic. Note that (i) is precluded in Or and Ames (2011), Simic et al. (2005) only for notational expediency 
since any multitude of domains may be embedded as disjoint submanifolds of a high-dimensional Euclidean space. 
The condition (ii) is excluded by Burden, Revzen and Sastry (2015) since it is generally incompatible with the results 
contained therein. 

One property of hybrid systems that is crucial to establish for the present setting is that the guards are disjoint, 
i.e. no state is a member of two distinct guards, so there is no ambiguity as to which reset map to apply. This key 
property yields the proof that the model is deterministic (Lygeros et al. 2003, Def. III.2). Eurthermore the system is 
set up such that every point on the boundary of the domain where the flow points outward is a member of a guard, thus 
guaranteeing that the system is non-blocking (Lygeros et al. 2003, Def. III.l), i.e. the execution continues for infinite 
time. 

The self-manipulation hybrid system developed in this paper uses the active contact constraints to define the dis¬ 
crete state or status (that we will call the mode), as in e.g. Brogliato et al. (2002), Hurmuzlu and Marghitu (1994). 
However even when starting with a simple Lagrangian hybrid system without modes for every contact condition it 
appears to be useful to add such states to allow executions to be completed beyond a so-called “Zeno equilibrium” 
(Ames et al. 2006, Or and Ames 201 1). Eurthermore, the pseudo-impulse we introduce avoids certain Zeno executions 
by allowing the system to remain in a constrained mode after finitely many transitions, in a manner analogous to but 
formally distinct from the truncation proposed in Ames et al. (2006), Or and Ames (2011). 

1.2.3 Consistent Complementarity 

Any formulation that allows for persistent contact through impact must determine which contacts to make active and 
which to remove®. When there is no impulse (i.e., no constraint to add, but one or more constraints have violated 
the unilateral constraint cone), the removal process is called force-acceleration complementarity, as it is commonly 
modeled by a complementarity problem involving contact force and separating acceleration, e.g. (Trinkle et al. 1997, 
Eqn. 12), (Brogliato et al. 2002, Eqn. 10), where in the simplest case of a single contact point with zero or negative 
contact force it is simply removed. This complementarity problem framework can introduce paradoxical consequences 
in certain physical problem settings, for example in taking the rigid limit of a deformable body (Chatterjee 1999). It 
can also be computationally efficient to relax the hard constraints of the complementarity conditions, resulting in a 
convex optimization problem (Drumwright and Shell 2011, Todorov 201 1). 

An impulse induced from one or more contact constraints becoming active will generally necessitate the removal 
of other constraints, specifically, those that would require a negative impulse to remain. When invoked as a modeling 
principle, this impulse-velocity complementarity precludes a simultaneous impulse and separation velocity at a par¬ 
ticular contact, e.g. (Lotstedt 1982, Eqn. 2.10b), (Brogliato et al. 2002, Eqn. 9). Imposing this modeling discipline 

®The removal ends up being the harder question, as “there is no problem in deciding when and which constraint to add to the active set since 
there is a constraint function to base the decision on. The problem of dropping constraints is more delicate...” (Lotstedt 1982, p. 283). 
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affords the well established benefit of yielding a unique post-collision state for collisions modelled as plastic friction¬ 
less impacts (Cottle 1968, Heemels et al. 2000, Ingleton 1966, van der Schaft and Schumacher 1998). Unfortunately 
further generalizations can lead to inconsistencies and ambiguities (Chatterjee 1999, Hurmuzlu and Marghitu 1994, 
Ivanov 1995, Seghete and Murphey 2010). The existence and uniqueness of a solution must therefore be separately 
established in each physical circumstance that includes friction - or merely be assumed. 

Massless legs introduce new problems into the complementarity problem. The massless leg condition in general, as 
introduced in (Johnson and Koditschek 2013a, Assumption C.6) and also used in countless prior works, e.g. (Blickhan 
1989, Holmes et al. 2006, Kajita et al. 1992), allows for the neglect of certain states deemed inconsequential to the 
dynamics of interest when unconstrained (of course, the appropriateness of this neglect is task dependent rather than in 
any way intrinsic to the underlying physics, c.f. (Johnson and Koditschek 2013a, Sec. IV.C.5) or Balasubramanian et al. 
(2008)). Indeed a massless leg that is not touching the ground is unconstrained and its position can be taken as arbitrary 
(or regarded as evolving according to dynamics sufficiently decoupled as to be considered independent), as used in the 
behavior analysis in (Johnson and Koditschek 2013a, Sec. IV.C.3). However the complementarity condition as used 
in e.g. (Lotstedt 1982, Eqn. 2.10b), (Brogliato et al. 2002, Eqn. 9), and listed in (51)-(52) is ill-posed in the absence 
of mass since there is no well-defined separation velocity or acceleration, nor anything precluding all massless contact 
points from always separating (at least for the dynamic model of interest here, as opposed to a quasistatic model, 
(Trinkle and Zeng 1995)). Instead here we reformulate the complementarity condition as (46)-(47) to not depend on 
the separation velocity. 

1.2.4 Impact Mechanics 

The usual Newtonian impact law (as in e.g. (Chatterjee and Ruina 1998, Eqn. 3), (Eeatherstone 2008, Eqn. 11.65) and 
many others) can be thought of as a mass-orthogonal projection onto the constraint manifold as used in e.g. augmented 
Lagrangian techniques (Bayo and Ledesma 1996, Eqn. 25). More generally, Moreau (1985) showed that impact 
problems can be modeled using measure differential inclusions. The algebraic plastic impact law involves inversion 
of the inertia tensor, which precludes the possibility of massless limbs and necessitates the reformulation given in this 
paper. Even if there are no truly massless links, a nearly massless body segment yields a poorly-conditioned inertia 
tensor (Johnson 2014, Sec. 5.1.1), leading to similar formulations as the one presented here in (Westervelt et al. 2003, 
Eqn. 9) or, for continuous time dynamics, (Holmes et al. 2006, Sec. 4.3) (Eeatherstone 2008, Eqn. 3.17). 

In this paper we restrict our attention to systems modeled as exhibiting only perfectly plastic impact (perfectly 
inelastic impact). In the elastic impact case, it is necessary to consider the relative stiffness of contact points; depending 
on the restitution law invoked, multiple outcomes are consistent with the constitutive assumptions (Chatterjee and 
Ruina 1998, Hurmuzlu and Marghitu 1994). Though it is possible to bypass this technical obstacle by introducing an 
additional constitutive hypothesis, e.g. (Ballard 2000, H3 in Sec. 3.3), it remains to be validated (either theoretically or 
experimentally) that such assumptions accurately represent the physical system’s behavior. Plastic impact avoids these 
inconsistencies, but more importantly we claim plastic impact provides a more useful model of the robotic systems of 
interest. Elastic impact is clearly needed in some robotics applications such as juggling (Buehler et al. 1994, Schaal 
and Atkeson 1993), tapping (Huang and Mason 1998) or ping-pong (Andersson 1989), but plastic impact, where 
there is no restitution and therefore no separation velocity after impact, is a more desirable model for most forms 
of locomotion (when it is important to keep feet on the ground) (Chatterjee et al. 2002, Westervelt et al. 2003) and 
manipulation (when it is important to keep fingers on the object) (Chatterjee et al. 2002, Wang and Mason 1987). 

The new pseudo-impulse presented here, in addition to the Zeno results mentioned above, eliminates other ev¬ 
idently unwanted transitions by allowing the continuous-time forces to play a role in the impact process which is 
primarily “logical” (as opposed to energetic). This role may be best summarized by comparison to the most common 
alternatives. Eor example, instead of introducing a variable coefficient of restitution (Quinn 2005) (which our plastic 
impacts of interest already eliminate), the pseudo-impulse is not applied to the continuous (energetic) system directly 
but instead used to regularize the complementarity driven hybrid switching logic. Or as a second point of comparison, 
rather than introducing a fixed dead zone in impact energy (Pagilla and Yu 2001) or velocity (Brogliato et al. 2002, 
Sec. 6.4), the magnitude of the effect on our model’s hybrid logic is not fixed but rather scales with the continuous time 
forces. An effect similar to this pseudo-impulse condition is also introduced by time-stepping simulations (Anitescu 
and Potra 1997, Stewart and Trinkle 1996), which, true to their name, always consider forces over small but finite 
time-steps. Under such schemes the magnitude of this effect is not a fixed, independent, user-imposed parameter since 
it must remain proportional to the duration of each time-step. Our preference for the independent, fixed choice reflects 
both mathematical convenience (the clearly defined hybrid dynamical system with its formal properties) as well as our 
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Base constraint function (2.1) 

Velocity constraint function (2.1) 
Force constraint function (8) 

C differentiable function (2.1) 
Solution to the PRED predicate (32) 
Coriolis forces (12) 

Force-acceleration predicate (38) 
Contact constraints (2.1) 

Set of active contact constraints (2.1) 
Complementarity scope (33) 

Identity matrix, of dimension |Q| (2.1) 
Impulse-velocity predicate (50) 

Set of all contact constraints (2.1) 
Inertia tensor (2.3) 

Constrained inverse inertia tensor (8) 
Potential forces (e.g. gravity) (12) 

New touchdown predicate (22) 
Impulse in state space (2.5) 

Impulses in constraint space (25), (56) 
Pseudo-impulse IV predicate (58) 
Continuous state (2.1) 

Continuous state and velocity (2.1) 
Touchdown predicate (21) 

Unilateral constraint cone (2.1) 
Corresponding normal constraint (2) 
Small time duration of impact (56) 
Instantaneous change in velocity (2.5) 
Lagrange multipliers (13) 

Constrained contact inertia tensor (8) 
External forces and torques (12) 
Trending negative/positive (Def. 1) 


Table 1: Key symbols used throughout this paper, in addition to (Johnson and Koditschek 2013a, Table 1), with section 
or equation number of introduction marked. See also Table 2 for symbols introduced in Section 3. 


taste in preferring to work with robotics models targeted for specific physical environments and settings. 

1.2.5 The Effect of Friction Models 

While this paper focuses on the impact problem, which friction greatly complicates (Keller 1986, McGeer and Palmer 
1989, Trinkle et al. 1997, Wang and Mason 1987, Wang and Kumar 1994), even simulating continuous-time dynamics 
of rigid bodies with friction can be difficult (formally VP-hard (Baraff 1991)) due to the possibility of “jamming” 
events (Dupont and Yamajako 1994, Mason and Wang 1988), first attributed to Painleve (1895). In this paper, follow¬ 
ing the model from Johnson and Koditschek (2013a), strong assumptions about frictional contact avoid these issues 
and enable integration of the dynamics as a differential-algebraic equation (DAE). As noted above, an alternative 
method to numerically solving these problems is the time-stepping approaches pursued in, e.g., Anitescu and Potra 
(1997), Stewart and Trinkle (1996), which resolve these issues by allowing for impulses at any time step. To resolve 
these issues in more general extensions of the system presented here (in particular those that are not well modeled 
by the frictional assumption, A12), the hybrid dynamical system could similarly be extended by allowing impulses at 
times without collisions, with such jamming events considered with additional guards and reset maps. We refer the 
interested reader to “Is Painleve areal obstacle?” (Brogliato et al. 2002, Sec. 8.1) for further discussion of these issues. 
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2 Modeling Assumptions 


The continuous Lagrangian dynamics of self-manipulation is specified in Johnson and Koditschek (2013a) using the 
notation and terminology of Murray et al. (1994) and summarized in Section 2.1. We continue to work within that 
framework here and briefly list the subtle differences between these two classes of systems in Section 2.2. However 
the impulsive dynamics (instantaneous changes in velocity when a new contact is added) were not specified in either, 
and so we will introduce a plastic impact model in Section 2.5 and explore the induced complementarity conditions in 
Section 2.6. In addition, will make explicit how the massless leg (Section 2.3) and frictional assumptions (Section 2.8) 
made in Johnson and Koditschek (2013a) affect both the continuous time (Section 2.4) and impulsive dynamics, lead¬ 
ing to a new formulation for the dynamics that is equivalent to the usual formulation when there are no massless links. 
Finally, Section 2.7 introduces a new pseudo-impulse that eliminates certain Zeno executions and related chattering 
behavior. 

2.1 Setup and Notation 

The notation used in this paper is chosen to be consistent with (Johnson and Koditschek 2013a, Table I) (and agreeing 
where possible with Murray et al. (1994)) or is defined as it is used and summarized in Table 1. The base component 
of the state is denoted, qG Q, while the full state is, Tq := (q,q), and this state completely describes the motion of 
interest, as. 

Assumption Al (Rigid Bodies). The robot is made up of a finite number of rigid bodies whose configuration lies in a 
connected complete C® Riemannian manifold Q. 

Since the configuration spaces of many extant robots are not linear (due e.g. to rotary joints, rigid body rotations, 
or constrained mechanisms), it is most natural to invoke the general framework of differentiable manifolds to model 
the state space. For concreteness we will often consider the case where Q : = 0 x ^^(d) consists of joint angles and 
the special Euclidean group of dimension d, but our formal results will be stated for an arbitrary connected complete 
C® Riemannian manifold Q. We recognize that this generality necessitates mathematical formalisms and notation that 
are not uniformly adopted in the robotics community (exceptions such as Murray et al. (1994) notwithstanding); we 
aim whenever possible to translate unfamiliar objects into standard terminology and provide a terse overview of the 
background material needed to parse the more general case in Appendix C. 

We are concerned with sets of contact constraints (e.g., I,J,K C /C) that we shall call modes or contact modes 
hereinafter, subsets of indices whose particular elements (e.g., i,j,k G Kf index the contact constraints that prevail at 
some instant (Johnson and Koditschek 2013a, Sec. II.C) (Murray et al. 1994, Sec. 5.2.1). In addition to contact with 
the robot’s environment, contact constraints may include cases of self-contact as well as joint limits. The universe of 
all possible constraint indices from which these subsets are taken will be denoted K. = /C„ U/C(, partitioned by those 
that are in the normal (non-penetrating) direction and those that are in tangential (non-sliding) direction. Similarly, 
for any set of constraints specified by mode /, define the subsets /„ : = /n/C„ and 4 '■= IDICt, where clearly I = I„ U4 
and 4i n4 = 0. 

Contact constraints in the normal direction^, i G K-n, specify a holonomic constraint of the form {(q,q) G TQ: 
a,(q) = 0} for a, : Q —R (and whose corresponding velocity constraint Ai : T Q ^ M. is equivalent to the Jacobian 
Dai, (Johnson and Koditschek 2013a, Eqn. 11)), while those in the tangential direction, i G ICt, specify a nonholonomic 
constraint of the form {(q,q) G TQ: A,(q)q = 0} where again Ai: T Q ^ M.. Eor a given contact mode I, the space 
of constrained positions is a manifold C/ of dimension |/| (i.e., the number of constraints in I). 

In the interest of notational clarity, we will generally express functional dependence on contact modes via subscript, 
e.g., J£ 7 (q,...) := A(/,q, ...), and when it is clear from context, we will further suppress the subscript, e.g. A(q, ...). Eor 
example, and used extensively throughout this paper, fixing an ordering on /C we can obtain the velocity constraints 
active in mode I, Aj :TQ^ TCj, as a selection of rows from the set of all velocity constraints Ajc, i.e.,^ 

A/(q) := A(/,q) = 7r/AK;(q), (1) 

^Note that normal direction constraints for non-adhesive contact will be unilateral, although within a contact mode they can be considered 
bilateral until the constraint force is violated (e.g. (Lotstedt 1982, Sec. 4)). 

^However, note that most functions of the mode are not a simple projections, and so e.g. A|, defined in (8), aJ ^ rather aJ is as 

defined in (8), i.e. constructed with the corresponding Ay. 
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where tt/ is the Boolean projection matrix formed by the rows of canonical unit vectors associated with the elements 
in the index set I. Similarly for a single constraint i. A, := TtiA/c = A{,}. 

We make the following assumption on the combined maps, 

Assumption A2 (Simple Constraints). All constraints are independent, that is for all contact modes I, the maps 
a/„ : Q —^ C/„ and A[: TQ ^ TCi are constant rank. 

We refer the reader to Appendix C for the definition of rank of a C'^ map; in coordinates, this condition states that 
the gradient vectors of each coordinate of the respective maps are linearly independent at every point. If this condition 
failed to hold, the configuration space could possess singularities that could preclude existence and/or uniqueness of 
trajectories for the mechanical system. Note that this precludes the possibility of redundant constraints, though there 
are methods of resolving such redundancies, e.g. in Greenfield et al. (2005). In particular, this requirement will be met 
ifaK;„ G and A^ G C“(7’Q,RI^I) are constant rank®. 

We note that there is an assignment. 


a-.K^lCn, «k„=Icl, (2) 

of contacts to normal contacts such that a\ic, maps tangential contacts to the corresponding normal contact (where Id 
is the appropriate identity matrix). Note that for each k G /C, and j = a{k), Aj is orthogonal to Aj. 

It is well established that the motion of mutually constrained rigid bodies can be effectively modeled using poly¬ 
nomial maps (Wampler and Sommese 2013), hence imposing contact constraints arising from their interaction with 
the piecewise polynomial representations of the environment (commonly adopted by the sensory community (Lalonde 
et al. 2007)) leads to. 

Assumption A3 (Analytic Constraints). All constraints are analytic functions, that is for all contact modes I, the 
maps a/,, ; Q —>■ C/„ and Aj : TQ ^ TCi are C“. 

Given an analytic vector field subject to an analytic constraint, as shown in Lemma 3 it is possible to determine 
whether the constraint remains active over a nonzero time horizon by evaluating Lie derivatives at a single instant in 
time. If either the vector field or constraint were merely smooth, the differential equation determined by the vector 
field would, in general, need to be solved over a nonzero time horizon to determine whether the constraint remained 
active. 

Assumption A4 (Persistent Contact). Contact with the world occurs through a finite number of active constraints 
indexed by I <zlC that apply continuous time forces. Furthermore, contact persists until the next event (e.g. touchdown 
or liftoff). 

This assumption is related to the Principle of Constraints, as discussed in Section 1.2.1. Its adoption partitions tra¬ 
jectories so that at all times between instantaneous touchdown or liftoff events there persists a well-defined set of active 
constraints (enabling the systematic a priori enumeration and analysis of these constraint sets and their sequences, e.g. 
Johnson and Koditschek (2013b)). This contrasts with simulations generated by time-stepping algorithms, wherein 
contact Stewart and Trinkle (1996) or interpenetration Anitescu and Potra (1997) are resolved only at multiples of the 
timestep, and no distinction between forces and impulse are made (indeed this relaxation is what enables the efficient 
and consistent simulation in such formulations). 

The impact problem can be summarized as determining which constraints to add or remove from the active set. 
The active set continues to constrain the system so long as the unilateral constraint cone (Johnson and Koditschek 
2013a, Eqn. 7) is positive, U(/l) > 0, where X G T*C is the vector of Lagrange multipliers (constraint forces) (Johnson 
and Koditschek 2013a, Eqn. 33). Included in U is both the non-attachment condition that normal direction forces are 
positive as well as the friction cone that relates the magnitude of the normal and tangential components. 

In the complementarity problems, the following definition simplifies statements involving higher-order derivatives 
of the state that seem to arise unavoidably (as stated in (van der Schaft and Schumacher 1998, Sec. 3), (Heemels et al. 
2000, Sec. 1), formalizing the concepts represented in e.g. (Eeatherstone 2008, Eig. 11.4), (Siciliano and Khatib 2008, 
Sec. 27.2)), 

®This stronger assumption would not be true if there were two parallel constraints that, due to geometry, could not simultaneously be active, in 
which case the original requirement must be checked for all I. 





(b) 




(c) (d) 


Figure 2: Four examples of a planar point particle (Q = with a single constraint (/C = {1}), defined as (a) 
a = x^+4y, (b) a = —x^+4y, (c) a = + 8y, and (d) a = —x^ + 8y. Note that if the particle velocity is directed to the 

right (q = [v 0]^, v > 0), as illustrated, then; the constraint function is trending positive (a(q) 0) in (a) and (c); the 

constraint function is trending negative (a(q) ^ 0) in (b) and (d). 


Definition 1. Given a smooth function h : M defined over a smooth manifold M, a point x € M, and a smooth 
vector field F : M ^ TM, we say that h is trending negative with respect to the vector field F at x, denoted h{x) -<f 0, 
(or h{x) -< 0 if the context specifies F), if 

3m>0: {C’fh){x) < 0 A Vf < m : {Cph){x) = 0, (3) 

where Cfh : M —>■ R /s the mf^ Lie derivative^^ of h with respect to the vector field F. Similarly, we say that h is 
trending positive at x, denoted h{x) >■ 0, when —h{x) -< 0. We say that h is identically zero at x, denoted h{x) = 0, 
when V £gN : {Cph)(x) = 0. Finally, we say that h is trending non-negative at x, denoted h(x) A 0, when h(x) >-0 or 
h{x) = 0, and that h is trending non-positive at x, denoted h{x) < 0, when h{x) -<Q or h{x) = 0. 

We refer the reader to Appendix C for the definition of a vector field F \ M TM-, in the case where M = R", 
the tangent bundle TM can be canonically identified with R" to obtain a more familiar function F : R" —R" that 
determines an ordinary differential equation x = Fix). 

That is, h{x) ^ 0 if and only if the following vector, 

[/r(x), {CFh){x), {C\h){x), ...] , (4) 

is lexicographically smaller than zero (Bertsimas and Tsitsiklis 1997, Def. 3.5). As an example of when these 
properties are important, consider the examples in Figure 2. In each case, the initial configuration (taken as the bottom 
point of the circle) is q = [x y]^ = [0 0]^, and the initial velocity is q = [v 0] v>0. Assume that the particle has 

unit mass (M = Id 2 ), and that there are no non-contact forces (N = 0, C = 0, T = 0, as defined in Section 2.4). In all 
cases the particle is touching the constraint (a(q) = 0) but has no impacting or separating velocity (Aq = 0), so there 
is no impulse (as defined in Section 2.5). Furthermore in c) and d) there is no impacting or separating acceleration 
(Aq + Aq = 0). However in a) and c) the constraint function is trending positive, a(q) A 0, while in b) and d) the 
constraint function is trending negative a(q) ^ 0. 

Furthermore, we will make use of the following properties of this trending relation. 

Lemma 1. The closure of {x : h(x) -< 0} or {x : h(x) 0} is {x: h{x) < 0}, while the closure of {x : h(x) >- 0} or 
{x : h(x) ^ 0} is {x: h{x) > 0}. 

This is easy to see as {x: h{x) < 0} C {x: h{x) ^ 0} C {x : h{x) < 0} for any vector field. 

Lemma 2. Given a smooth vector field, F : M TM, a point in a smooth boundaryless manifold, x G M, and a 
smooth positive function, g : M ^ R+, any other smooth function, /i : M —^ R, is trending negative if and only if its 
product with g has the same property, i.e., 

h{x) ^F 0 six) ■ h{x) ^F 0- (5) 


***866 e.g. (Lee 2012, Ch. 9), and note our convention that, C^h = h. 
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See Appendix A for a proof. 

Lemma 3. Let h\ M be a C“ function and F : M ^ TM be a C“ vector field over a C“ boundaryless manifold 

M, and let % ; (—e, +e) M denote an integral curve for F through x := x(0). Then h is trending positive at x with 
respect to F, h(x) 0, if and only if there exists 5 € (0,e) such that, 

Vi G (0,+5) :/zo j(i) > 0. (6) 

The requirement that the manifold be boundaryless is introduced to simplify the statement of this Lemma; the Lemma 
clearly applies to the interior of a manifold with corners (which is, after all, simply a manifold without boundary) 
(Joyce 2012, Def. 2.1). 

To see that the lemma is true, note that if x is an integral curve for F such that h ox{s) is positive for i > 0 suffi¬ 
ciently small, then since h is analytic we conclude (3) is satisfied. The other direction follows easily by contradiction 
using the mean value Theorem. We note that this is not true if h or F are merely C“. Also note that the conditions of 
the lemma do not imply that V s G (—5,0) : h ox{s) < 0 for two reasons: 1) it is possible that h{x) f 0, and 2) even 
for h{x) = 0, grazing contact would handled incorrectly (consider the horizontal vector field in the plane, i = 1, and 
the function h{x,y) = —y at the origin). 

Lemma 3 implies a computationally efficient way to test these trending conditions is to simply integrate a flow 
until it reaches a zero crossing. 

2.2 Manipulation and Self-Manipulation 

This section will briefly summarize the self-manipulation formalism introduced in Johnson and Koditschek (2013a), 
as it relates to manipulation, e.g. as presented in Murray et al. (1994). Each defines a number of frames on the robot 
and its environment - the palm frame, the object frame, the contact frame, etc. In an effort to keep the problems as 
similar as possible, the following conventions were adopted in Johnson and Koditschek (2013a), 

• In self-manipulation, the robot is the object being manipulated and so to properly consider the forces and torques 
on the object the robot’s palm frame, P, and the object frame, O, are chosen to be coincident, (Johnson and 
Koditschek 2013a, Sec. II-B). 

• Thus motions that would, in a manipulation problem, move an object to the right will really move the robot to 
the left, and so the self-manipulation grasp map (a component of A) is a reflection of the manipulation grasp 
map, Gi := —G, (Johnson and Koditschek 2013a, Eqn. 15). 

• By convention the contact frame is defined at any point of contact with the z-axis pointing into the object (away 
from the Anger tip), (Murray et al. 1994, Sec. 5.2.1). In self-manipulation the convention of (Johnson and 
Koditschek 2013a, Sec. II-C) is to keep the contact frame consistent with respect to the legs, and so the z-axis 
points away from the robot and into the ground. This results in a unilateral constraint cone, U, that is negative, 
(Johnson and Koditschek 2013a, Eqn. 76, 78). 

• Since the palm reference frame is accelerating with respect to the world, the inertia tensor, M, (Johnson and 
Koditschek 2013a, Eqn. 26), and by extension the Coriolis terms, C, (Johnson and Koditschek 2013a, Eqn. 30), 
are more coupled and lack the block diagonal structure present in manipulation problems, (Murray et al. 1994, 
Eqn. 6.24). 

It should be no surprise that the problem formulations are structurally equivalent since the underlying kinematics 
and dynamics are indifferent to the problem category. However owing to the notational differences summarized 
above, through the remainder of this paper we choose to write out the problems in terms of a self-manipulation 
system, with the understanding that the results contained herein apply equally well to manipulation systems once 
these transformations are incorporated. 

2.3 Massless Considerations 

To properly define the dynamics of a partially massless system, consider a parametrized family of singular semi- 
Riemannian metrics, 

M£(q):Qx[0,£]^R^^^ (7) 
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such that Mo(q) := M is the (possibly) degenerate inertia tensor for the system (Johnson and Koditschek 2013a, 
Eqn. 26) and may be singular, while e assigns a small mass and inertia to any putatively massless links such that Mg (q) 
is full-rank for all e > 0 (for our present purposes, it is sufficient to use a limiting model such as Mg := Mq + eldq 
rather than some more specific physically motivated one). We invoke the general definition of Riemmanian metric 
here since it provides the coordinate-invariant formulation of the familiar mass or inertia matrix associated with a 
collection of rigid bodies, and refer the reader to Lee (1997) for a formal definition and Section 2.4 for additional 
details. The dynamics of the system in contact mode I can be expressed (as shown below) using the inverse of the 
following block matrix containing Mg, defining T*C, M^ :T*Q^T^Q, and A : T^C T*C as. 


mJ a|^ 

a] Aj 


:= lim 
£->0 



( lim 

■ M, 

Aj ' 


■ Mo 

Aj ■ 

\e->0 

A/ 

0 

) 

A/ 

0 


From this definition, note that the following properties hold. 


-1 


AU^ = AA'*'^ = Id, 

M^M-|-A^^A = Id, 


M^A^ = AM^ = 0, 

A^M-f-AA = 0. 


( 8 ) 

(9) 


( 10 ) 

( 11 ) 


To ensure that the inverse of the matrix in (9) (sometimes called the “Lagrangian matrix of coefficients”, e.g. 
(Papalambros and Wilde 2000, Eqn. 7.79), and sometimes used in robotics for numerical reasons, e.g. (Holmes et al. 
2006, Sec. 4.3)) is well-defined, we will require some modeling assumptions on the nature of the massless appendages. 
Thus if the inverse exists, this e-parametrized curve takes its image in GL{n) (the group of invertible matrices over 
R") within which matrix inversion is a continuous operation, hence the limit commutes with the inverse operation, and 
mI is a well defined smooth curve defined over all e S [0, e]. 

To meet this requirement, massless appendages will be allowed here only in a limited form. 


Assumption A5 (Constrained Massless Limbs). For all limbs in contact with the world, any rank deficiencies of the 
inertia tensor M (Johnson and Koditschek 2013a, Eqn. 26} are “corrected” by velocity constraints A sufficient to 
guarantee that any remaining allowed physical movement excites some associated kinetic energy, that is, the block 
matrix in (9) is invertible. 


If the “rank correction” condition in this assumption were violated, then it would not be possible in general to 
determine the system’s instantaneous acceleration solely from the internal, applied, and Coriolis forces; it could happen 
that either no accelerations are consistent with the net forces, or an infinite set of accelerations are. This condition 
admits its most physically straightforward expression via the requirement that the inertia tensor is nonsingular when 
written with respect to generalized or reduced coordinates, M (i.e., any local chart arising from an implicit function 
solution to the constraint equation (Johnson and Koditschek 2013a, Eqn. 10)). However, for purposes of this paper, 
we find it more useful to work with the Lagrange-d’Alembert formulation of the constrained dynamics, (Johnson and 
Koditschek 2013a, Eqn. 33), hence, we translate that natural assumption into more formal algebraic terms governing 
the relationship between the lifted (velocity) constraints, A (Johnson and Koditschek 2013 a, Eqn. 11), and the overall 
inertia tensor M as follows. 

Lemma 4. The matrix (9), is invertible if and only if the inertia tensor expressed in generalized or reduced 

coordinates, M (Johnson and Koditschek 2013a, Eqn. 36), is invertible (Johnson and Koditschek 2013a, Sec. ILK, 
Assumption A.4). 

as shown in Appendix B.l. See Section 4.2 for a discussion of physical scenarios that meet this requirement. 

When not constrained on the ground, any such massless links or limbs must then be removed from consideration as 
mechanical degrees-of-freedom: since they are massless, when unconstrained, the associated joints can be considered 
to have arbitrary configuration. Their evolution is instead treated according to the principle. 

Note that (Johnson and Koditschek 2013a, Eqn. 40) used the notation A* while in this paper we will use to signify the slight difference in 
definition used here, and to avoid confusion with the pullback of A, usually noted as A*, but which happens to be A^^. 
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Assumption A6 (Unconstrained Massless Limbs). For all limbs not in contact with the world, any components of the 
state that do not excite some kinetic energy must be removed from the usual dynamics and instead considered to evolve 
in isolation according to some independent, decoupled dynamics. 

In the same vein as the remark following Assumption A5 (Constrained Massless Limbs), we observe that it is not 
possible to uniquely determine accelerations of unconstrained massless limbs due to corresponding degeneracy in the 
inertia tensor. Excluding such limb states from the coupled Lagrangian mechanics governing the remaining body and 
limb segments will enable us in the sequel to specify a differential-algebraic equation that admits unique solutions. 
As the dynamics of the excluded states do not affect those of the remaining states, for the rest of this section we will 
abuse notation and suppress the subscript I from the state space Q, so that unless stated otherwise we are concerned 
with only the “active” component Qj of the decomposed state space for the mode of interest. See also Section 4.4 for 
a discussion of Zeno (Def. 6) considerations with massless legs. 

2.4 Continuous Dynamics 

With this notation, the continuous-time dynamics of (Johnson and Koditschek 2013a, Eqn. 33) in contact mode I can 
be expressed as. 


q/ := m] {Yj - Ciq - N/) - Af A/q, (12) 

I, a] {Y, - C/q - N/) - A/A/q, (13) 

where T/ is the applied forces, C/ is the centripetal and Coriolis forces, and N/ is the nonlinear and gravitational forces 
(Johnson and Koditschek 2013a, Eqn. 30, 31). 

When Mg, (7), is invertible (including, possibly, even for e = 0), it is easy to verify the equivalences (and dropping 
for now the subscripted contact mode /), 

M^ = M^'-Sr'A^(AM^‘A^)^'AM^\ (14) 

A^^ =M^'a^(AM^‘a^)-', (15) 

A = -(AM^'A^)-^ (16) 

as shown in Appendix B, (114). Note that constructions such as these are commonly used in robotics when M is 
invertible, e.g. (Khatib 1983, Eqns. 45^6) and many others (where their A^ has the opposite sign of our A and their / 
corresponds to A^^, although note that the dehnition ( 15) is exact and not dehned as a minimal-energy pseudo-inverse). 

Lemma 5. When Mq = M is invertible, the dynamics (12) and (13) are equivalent to the more common expression (as 
stated e.g. in the last equations of (Johnson and Koditschek 2013a, Appendix D), or (Murray et al. 1994, Eqn. 6.5, 6.6)), 

q = M^‘(Y-Cq-N-A^l), (17) 

A = (AM^‘a^)-' (a^‘ (T-Cq-N)-fAq) . (18) 

The claim follows directly from substituting (14)-(16), the explicit solution to (8) when M is invertible, into (12)- 
(13), as worked out in Appendix B.2. 

Whether M is invertible or not, we require. 

Assumption A7 (Lagrangian Dynamics). In each contact mode I, the time evolution of the active coordinates of the 
system are governed by Lagrangian dynamics, and the applied forces are such that the vector field defined by (12) 
for coordinates in Qj and^^ Fifor coordinates in Qi is forward complete, i.e. the maximal integral curve through any 
initial condition is defined for all positive time. 

That is, in contact mode /, the configuration manifold Q decomposes as a product of manifolds Q = Qi x Qi, where Qj corresponds to a 
subset of the system coordinates such that the matrix in (9) is nonsingular, and Q; corresponds to the remaining coordinates. The dynamics for the 
coordinates of Q/ is given by some vector field T/. Here we have written the dynamics as a second order vector field so that the dynamics of the full 
system may be written in a notationally consistent manner. This is not required; regardless of how the dynamics are defined for these coordinates, 
there will be no coupling of energy with the rest of the system through the inertia tensor. 
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Recall from the rigid body and unconstrained massless assumptions (A1 & A6) that the configuration space, Q, 
is a manifold without boundary. Thus the major obstacle to verifying Assumption A7 lies in preventing finite-time 
“escape” from the state space TQ, e.g. because the velocity grows without bound or there are “open edges” in the 
configuration manifold (i.e. the manifold is not compact). If the configuration manifold were compact, then it would 
suffice to impose a global bound on the magnitude of the vector field in (12). If the configuration space were instead 
Euclidean, Q = R", then it would suffice to impose a global Lipschitz continuity condition on the vector field in (12). 
We note that configuration obstacles such as joint limits or self-intersections are treated as constraints in Section 3, 
and hence pose no obstacle to satisfying the above boundarylessness and completeness conditions on the configuration 
space. 

However, since in examples of interest the configuration space is neither compact nor a vector space (due e.g. to 
rotary joints, rigid body rotations, or constrained mechanisms), we often require a more general condition. One such 
condition is obtained from (Ballard 2000, Thm. 10); since we rely on this sufficient condition elsewhere in the paper, 
we transcribe it explicitly into our notation as follows. When Q is a complete connected configuration manifold and 
M is a nondegenerate inertia tensor (i.e., at every q € Q the coordinate representation of M(q) is invertible, thus here 
precluding the possibility of massless limbs. Assumption A5), we let : Q x Q —R denote the distance metric 
induced by the Riemannian metric associated with M (Lee 1997, Ch. 6). For any vector q € TqQ we define 

IqIm •= Por covector f G T*Q we define := where f* G TqQ is the vector obtained by 

“raising an index” (in coordinates, f* = M *f^) (Lee 1997, Ch. 3). 

Lemma 6. If the ambient configuration space Q is a complete connected Riemannian manifold, M is a nondegenerate 
inertia tensor, and the magnitude ofYj — N/ grows at most linearly with velocity and distance from some (hence any) 
point in Q, i.e. if there exists C G R, qo G Q such that, 

V(q,q) €TQ: |T/(q,q) - N/(q,q)|j^-i < C(1-f Iql^ + %(qo,q)), (19) 

then the flow associated with the vector field (12) is forward complete, i.e. the maximal integral curve through any 
initial condition is defined for all positive time, and hence Assumption A7 is satisfied. 

Proof. This is simply an application of (Ballard 2000, Thm. 10) in the absence of unilateral constraints. □ 

We expect this condition to be met by any model based on a physical system, and is trivially met if there is a global 
bound on the magnitude of the applied, T/, and potential, N/, forces (whereas, notice, the necessarily unbounded 
Coriolis and centripetal forces are accounted for by the Lemma and require no further consideration). 

Unfortunately this condition assumes that the inertia tensor M is nondegenerate, precluding the presence of mass¬ 
less limbs (Assumption A5). Allowing instead for a degenerate inertia tensor but enforcing the unconstrained massless 
limb assumption (A6), we now describe a set of sufficient conditions that ensure Assumption A7 holds. 

Lemma 7. Suppose that in each contact mode I the active constraints are either holonomic or integrable ( Murray 
et al. 1994, Sec. 6.1.1), meaning that there exists a reduced configuration manifold Yi (i.e., generalized coordinates) 
such that every point in Qj lies in the image of an embedding h : ^ Qi that is invariant under (12) (Johnson and 

Koditschek 2013a, Sec. G) and restricted to which the reduced inertia tensor ( Johnson and Koditschek 2013a, Eqn. 36) 
is nondegenerate. 

If the hypotheses in Lemma 6 are satisfied for Yi, its reduced inertia tensor, and its reduced dynamics (Johnson and 
Koditschek 2013a, Eqn. 34), and furthermore the vector field Fj governing unconstrained massless limbs is forward 
complete and uncoupled from the massive or constrained coordinates, i.e., 

Tq/ = Dh{Yyj), q^ = Ffiqj), (20) 


then Assumption A7 is satisfied. 

Proof. We seek to define a forward-complete flow (j)j : x TQj ^ TQj consistent with the vector field in (12). 

Let h : yj ^ Qj denote the embedding associated with the reduced coordinates (Johnson and Koditschek 2013a, 
Sec. G). Apply Lemma 6 to the reduced system to obtain a forward-complete flow (j)j : [0,°°) x Tyj Pyj. Then 
since (Ballard 2000, Prop. 3) implies Dh maps integral curves from the reduced state space to the original, for all 
t G [0,oo) and Ty G Pyj, defining (j)(t,Dh(Ty)) = Dh o <j)(t,Ty) yields the desired forward-complete flow on Q/. □ 
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Lemmas 6 & 7 provide sufficient conditions guaranteeing that certain systems with either full rank inertia tensors or 
only holonomic constraints satisfy Assumption A7 - in the most general case, however, this will remain an assumption. 
We speculate that it is possible to derive a condition analogous to (19) using concepts from singular Riemannian 
geometry Hermann (1973) that ensure the existence of a forward-complete flow in the presence of nonintegrable 
constraints and a singular inertia tensor. 

2.5 Impulsive Dynamics 

Define the touchdown predicate, TD : /C„ x LQ B, where B := {True,False}, as, 

TD(k,Tq):=a,(q)=0AA,(q)q<0, (21) 

so that TD(A:,Tq) is true only at those points q where contact k should be considered for addition (in a manner to be 
qualified in Theorem 2 by the impulse-velocity complementarity condition, (50), defined below). Furthermore, define 
the new touchdown predicate. 


NTD(Tq) \/ TD(k,Tq), (22) 

keK„ 

such that NTD(Tq) is true only at those states where some new constraint is impacting. 

At impact into contact mode J, any incoming constraint velocity A/q must be eliminated. Here, we assume a 
Newtonian impact law, e.g. (Chatterjee and Ruina 1998, Eqn. 3) or (Featherstone 2008, Eqn. 11.65), that is. 

Assumption A8 (Plastic Impact). Impacts will be plastic (inelastic), occur instantaneously, and their effect described 
by an algebraic equation (23), defined below. 

In general, Aqy q^ — q^, the instantaneous change in velocity from q^ in contact mode I before impact to q^ 
in contact mode J after impact, is modeled as, Aqj = — (1 +e)Aj^Ayq^ (recall that A^^ :TC ^ TQ maps velocities 
in the contact frames to velocities of the system state). The coefficient of restitution, e, may be defined in any of the 
usual ways, however throughout this paper plastic impact (e = 0) is assumed. We restrict to plastic impacts as we 
believe it to be more relevant to most robotics applications, and since ambiguities arise when an elastic impact occurs 
in a system with multiple active constraints: different choices of impact model can yield distinct post-impact velocities 
(see Section 1.2.4). Eor plastic impacts, the post-impact velocity in mode J is, 

q+ = (Id - Af Ay)q- = M^Mq-, (23) 

where the final simplification follows from (11) and matches (Westervelt et al. 2003, Eqn. 9). The body impulse in 
configuration coordinates is, 

Py:=-M(q+-r)- (24) 

The contact impulse (i.e., the impulse at the contact points that induces the desired change in velocity to agree with 
the new contact mode J) is, 

Py : = aJP/ = aJma]^A yq- = -AyAyq- = A^Mq-, (25) 

where recall that Ay, A), M, and Ay are functions of the state q (which does not change during impact, i.e. q+ = q^), 
and the impulses, Py and Py, are also functions of the incoming velocity, q^. The final simplification arises from (11), 
matches (Westervelt et al. 2003, Eqn. 10), and will be used in Section 2.7. 

Lemma 8. When M is invertible, contact impulse (25) into contact mode J is equivalent to the non-degenerate plastic 
impact law, 

Py=(AyAr‘Aj)-'Ayq-, (26) 

as listed e.g. in ( Chatterjee and Ruina 1998, Eqn. 3). 

As with the proof of Lemma 5, the result may be seen by substituting (15) or (16), the explicit solution to (8) when 
M is invertible, into (25), as worked out in Appendix B.3. 
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2.6 Complementarity 

We now introduce the classical complementarity problems for forces and impulses at the contact points, and provide 
a reformulation that allows massless limbs. We begin with a general statement of the complementarity property (as 
in e.g. Cottle (1968), Ingleton (1966), Lotstedt (1982), van der Schaft and Schumacher (1998)), then subsequently 
specialize in Sections 2.6.1 and 2.6.2 to formulations of force-acceleration and impulse-velocity complementarity. 
Both versions have the general form of seeking real vectors y and z such that, 

y > 0, z > 0, y^z = 0, (27) 

(where for a vector y, y > 0 y, > OV i) subject to some problem-specific constraints. While the most general 
problem is uncoupled, that is y and z may be chosen arbitrarily so long as they satisfy (27), the cases we consider here 
are coupled by these problem-specific constraints (Pang et al. 1996, Sec. 3). In the linear complementarity problem 
(LCP), for instance, the coupling constraint has the form z := Ay-f c (e.g., (Brogliato et al. 2002, Eqn. 8)). The 
functional relationships between y and z for the complementarity problems in this paper will in general be nonlinear 
(as discussed in the rest of this section). Since the relation of interest will generally be problem-specific and index 
dependent in an essential way, we introduce temporarily an abstract scalar relation, > instead of ^ or > and similarly 
\> instead of or >, whose different instantiations will be prescribed in the force-acceleration and impulse-velocity 
versions of the problem, respectively. 

Solutions to this problem produce a natural bipartition on some index set, I, the scope (some subset of the 

universal scope /C, to be discussed below), where J = {j G I: zj = 0} and J^= X\J = {j gX : Zj \> 0}- Here, the 
role of y and z will be played by physically determined functions of a specified (“incoming”) state, Tq^ = (q^,q^), 
to yield an “outgoing” bipartition of the indexing scope, X. The indexing scope will be a function only of the 

incoming continuous state, X : TQ^ 2^, as defined in (33). 

It should now be clear that for this paper the complementarity problem is reduced to finding the unknown biparti¬ 
tion also known as the mode selection problem (van der Schaft and Schumacher 1998), as opposed to finding 

the values of the two complementarity vectors directly, e.g. (Cottle 1968). Namely, given an index set X, two functions 
y,Z : 2^ X TQ^ that map a subset J G 2^ into a Euclidian space with dimension equal to the size of the index 
set, and a generic relation > (to be instantiated as or > in the following sections), we require a solution to a set of 
constraint equations of the form, 

T,(y,Tq-)>0, Z,(y,Tq-)=0, ^ j GXdJ, (28) 

ijt(y,Tq-)=0, Z^(y,Tq-)>0, ykGX\J, (29) 

(where by definition, XDJ = J). Eor the complementarity problems of interest in this paper, the equality constraints 
in (28)-(29) will hold for all arguments (7,Tq^) g 2^ xTQhy construction (enforced, e.g., by the flow (12) in the 
force-acceleration version, and by the impact map (25) in the impulse-velocity version). 

The complementarity problem as stated thus far is not explicitly coupled (Pang et al. 1996, Sec. 3), i.e. it places 
no requirements on the relationship between f). and other than their common dependence on J and Tq^, which 
is why existence and uniqueness properties are challenging to define in general. Eurthermore, this necessitates the 
evaluation of both 1). and Z^. for constraints k that are not in J. With the possibly massless limbs in our setting, the 
evaluation of will not always be possible as the concept of a separation velocity or acceleration is poorly defined 
(once such a contact point has lifted off the ground the corresponding joints must be dropped from the state according 
to the unconstrained massless limb assumption, A6). Thus the specifics of Z^ in the problems considered in this 
paper necessitate an alternate formulation that takes advantage of the coupling between k). and Z^, as the inequality 
constraints have the property that. 


Zk{J,Tq-)l>0^Yk{JU{k},Tq-) p^O, (30) 

(the importance of the (yu{k}) mode was first noted in (Ingleton 1966, Eqn. 1.7.3)). This suggests the combined 
expression. 


(k G 7) (y):(7U{k},Tq )>0), VkGl, 


(31) 


which is equivalent to (28) & (29), 
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Lemma 9. The separate relational statements of the complementarity problem, (28)-(29), are equivalent to a single 
biconditional statement, (31), provided that the complementary functions Y and Z satisfy (30). 

Proof First note that for A: G 7 it is trivially true that 7 U {A:} =7 and so (3 1 ) simplifies to the first condition of (28). 
For kfiJ, the expression in (31) along with the substitution of (30) reduces to the second condition of (29). □ 

Expressing (31) as a predicate FRED : 2^ x FQ —^ B, 

PRED(7,Tq^) = (7CI) /\ G 7) (yi:(7U {/t},Tq^) ^ 0). 

kel 


We will denote by, 

CPpRED:7’Q^2'^:Tq-^7, (32) 

the implicit function that solves this set of constraints for the unknown required bipartition, where PRED varies with 
the particular instances as determined by the appropriate form of Y^. Note that while the codomain is 2^, the solution 
will always be a member of 2^. 

The existence of this implicit function (32) (i.e., the existence and uniqueness of a solution, 7, to the mode selection 
problem) will in the most general cases have to be an additional assumption*^ (see Assumption A9 and A10, below), 
although the specific complementarity problems in this section (i.e., based on the relationship of the specific functions 
Y and Z used in these cases) in the absence of friction reduce down to the conventional LCP problem and so existence 
and uniqueness has been proven in e.g. Cottle (1968), Ingleton (1966), Lotstedt (1982), van der Schaft and Schumacher 
(1998). 

The motivating literature and related work discussed in Section 1.2 generally imposes two complementarity con¬ 
ditions on rigid body dynamics models. The force-acceleration (EA) variant of (28)-(29), presented in (35)-(36), 
stipulates that there cannot be both a continuous time contact force and a separation acceleration at the same contact 
point, and is widely considered to arise from fundamental physical reasoning. In the present setting, EA comple¬ 
mentarity governs exclusively the nature of liftoff events (and extended in Section 2.8 to stick/slip events) wherein 
the number of active contacts (i.e., cardinality of the mode set) is reduced for reasons discussed in Section 2.6.1. In 
contrast, during instantaneous impact events the contact forces have no time to perform work. Instead, the impulse- 
velocity (IV) variant of (28)-(29), presented in (46)-(47), precludes a simultaneous impact-induced contact impulse 
and separation velocity at the same contact point. This constraint is known not to follow inevitably from the rational 
mechanics of rigid body models (Chatterjee 1999), but represents a commonly exploited algorithmic simplification 
that we will embrace in this inelastic model at the possible expense of consistency with elastic impact models in the 
limit. In the present setting, IV complementarity governs exclusively the nature of touchdown events wherein one or 
more new contacts become active (i.e., cardinality of the mode set is increased) for reasons discussed in Section 2.6.2. 

2.6.1 Force-Acceleration (FA) Complementarity 

For continuous time contact forces, when NTD(Tq) is false and therefore P = 0 when one or more contact constraints 
violate the unilateral constraint cone*^ U, some constraint will lift off and must be removed from the set of active 
constraints, resulting in a transition to a new mode. Determining that next mode sets up a complementarity problem 
over the existing contact mode / between the unilateral constraint cone, Ui:(A), if the contact is kept, and the separation 
acceleration ^A^q = A^q -f A^q if it is removed (recall that as an active constraint the state velocity is initially 
Aj(.q = 0). The full scope of contact constraints that should be considered is the set of all contacts which are “touching”, 
i.e. those whose normal direction component have zero contact distance and a non-separating velocity*^, 

I(Tq) :={iG/C:a„p)(q)=0AA„(,.)(q)q<0} (33) 

={/G/C: (a„(,)(q) = OAA„(,)(q)q = 0)vTD(a(0,Tq)}. (34) 

Recall that force-acceleration complementarity only holds when NTD(Tq) is false and so the final condition will apply 
here. Furthermore, while the full scope is formally required and does not depend on the active mode, numerically it 

^^However note that the remainder of this paper only requires a unique choice of a solution that satisfies the predicate if multiple solutions exist. 

Recall from Section 2.2 that U in the normal direction is — 1 according to the frame conventions of (Johnson and Koditschek 2013a, Eqn. 76,78). 

Note that thus fai‘ only normal direction constraints have been considered, however Section 2.8 will extend this to include tangential (sliding 
friction) constraints and this scope is defined in this general way in order to apply there as well. 
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suffices to check /Cl- this reduced scope eliminates the numerical challenge of checking the exact equality of (33). 
Any constraints that are not in I and therefore not algebraically guaranteed to satisfy this equality will, due to numerical 
error, only be close. This numerical approximation will miss cases such as Figure 2, in particular cases (a) and (d), 
wherein a constraint j that is not in the current active mode (j ^ I) satisfies the scope in (33) (j G I). In all of the 
cases in Figure 2, the point has zero contact distance (ay(q) = 0) and no relative velocity (Ayq = 0). However these 
examples are not generic as any perturbation in the state or constraint will resolve this problem. 

For transition into 7, consider contact force (13) both in J but also in the alternative mode 7U {k} where contact k 
is maintained (the reason for this alternative mode will become clear in Theorem 1), 


Uj(ly) ^ 0,A/q + A^q = 0, V; G In/, (35) 

U,(ly)=0,U,(lyuw)^0^ VkGiy, (36) 

where the identically zero constraints are guaranteed to hold in consequence of the dynamics governing mode J, 
namely, the invariance of the flow (A^q + A^q = 0 V y G 7 by (12)) and the Lagrange multipliers (U^(Aj) = 0 V k ^ 7 
by (13)). Note the importance here of the trending positive/negative conditions (Definition 1) - in general it is not 
sufficient to simply check the sign of the contact force but possibly higher derivatives as well. For example, in 
Figure 2, cases (c) and (d), assume the particle is sliding along the constraint from left to right. At the moment the 
particle reaches the origin, the contact force is zero. However in (c) the contact force is trending negative and the 
constraint should be removed, while in (d) the contact force is trending positive and it should be maintained. 

Constraints (35), (36) can be simplified into a form analogous to (30), hence, by Lemma 9, they may be further 
reduced to. 


(kG7)^(U,(lyuw)^0)’ (37) 

or as the predicate FA ; 2^ x TQ —>• B, 

FA(7,Tq-) = (7 Cl) /\ (k G 7) (U,(Ayuw) ^ 0), (38) 

kex 

for which set of constraints we will write the associated implicit function solution, following (32), as, CPFA(Tq^) = 7. 

This formulation of the force-acceleration complementarity problem is required to allow for massless limbs, for 
which a separation acceleration is not well defined. However when the separating acceleration is defined. 

Theorem 1. The non-penetrating acceleration condition at a contact k after liftoff into mode J, Aj-q + A<;q 0 
(when such an acceleration is well defined), is equivalent to a trending negative contact force 0 in mode 

K ;= 7U {k}. In other words, (35)-(36) are equivalent to the usual formulation (e.g., (Brogliato et al. 2002, Eqn. 10)), 

Uy(Ay) ^ 0,A/q + Ayq = 0, Vy G In7, (39) 

U^(Ay)=0,A^q + A^q^0, ykGl\J. (40) 


Proof Recall from Section 2.1 that Ki is the canonical projection onto the linear subspace spanned by the coordinate 
axes indexed by I, and assume without loss of generality that k is the final index in K such that. 


Aa: := ttatA/c 


■ Ay ■ 



A^. 




(41) 


The constraint cone, following (Johnson and Koditschek 2013a, Eqn. 78), applied to the contact forces, (13), is (see 
Appendix B for details, in particular (117) expanding A^ and Ak), 


\lk{lK) = -% (a; (Y - Cq - N) - AKkKOi 


AlM/ /_ - -X 1 r XT’ 

--^;t^(Y-Cq-N)--^^[ A,Af 


A^-MyAi; 


AtMtA 


J^k 


A^My (T - Cq - N) - A^Af Ayq + A^q 


a,m]a[ 



(42) 

(43) 

(44) 
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( 45 ) 


while the separating acceleration for constraint k in mode J is, using (12), 

A^qy+A,qy = A, (Mj(Y-Cq-N) -Af Ajq) +A,q = -(A,mJa[)U,(1^). 

Since the denominator in (44) is a positive scalar function of state (as shown in Appendix B, ( 11 8)), by Definition 1 and 

Lemma 2 a trending positive separation acceleration, A^q + A^.q implies a trending negative contact force, 

and vice-versa, and therefore (35)-(36) are equivalent conditions to (39)-(40). □ 

Furthermore, with or without a full rank inertia tensor, we assume the existence of a unique solution to the force- 
acceleration complementarity problem. 

Assumption A9 (Force-Acceleration Complementarity). The force-acceleration complementarity constraints, FA, 
(38), always admit an implicit function, CPfa> over the entire state space T Q, even under the frictional properties that 
follow Assumption All. That solution correctly captures the behavior of the physical system. 

While there has been a long line of literature (e.g., (van der Schaft and Schumacher 1998, Ex. 3.3)^®) that proves 
that this is always true for independent, plastic, frictionless contacts, no result has been found that covers the limited 
frictional conditions introduced in 2.8. We impose this condition in support of the Theorems in Section 3. 

2.6.2 Impulse-Velocity (IV) Complementarity 

Impact at one contact location can cause another contact to break, as the contact impulse must obey the unilateral 
constraint cone, Uj(P 7 ) > 0 V j G J, i.e. both that the impulse in the normal direction be positive (non-adhesive) and 
that the tangential impulse lie in the friction cone (Chatterjee and Ruina 1998). Any contact point that would have 
violated that requirement must be dropped from the active contact mode. 

In addition the post-impact velocity must not allow the removed contact point to leave with a penetrating velocity 
(i.e., the impulse cannot result in a velocity “into” the surface). However in the case of massless legs a positive 
separation velocity is always achievable. As an alternative requirement that is based only on impulsesconsider 
the contact impulse (25), Py (associated with the passage from contact I to contact J), but also the contact impulse 
Pyu{A:} (associated with the passage from contact I to alternative mode 7 U {k} where contact k is maintained). These 
impulses, along with the post-impact velocity (23), A/q+, must satisfy, 

U;(Py)>0,Ayq+=0, Vjelny, (46) 

U,(Py) = 0,U,(Pyuw) <0, Vk G T\J, (47) 

where the scope, I, is again formally the set of all touching constraints, (33). However in numerical simulation it is 
sufficient to check only the active constraints as well as those touching down*^, (21), 

Iiv :=/U{/G/C\/:TD(a(0,Tq)} Cl. (48) 

Note that the equality constraints are enforced by the impact law (25), and so, by Lemma 9, these constraints may be 
summarized as, 

(kG7)4^(U,(Pyuw)>0), VkGX. (49) 

or as the predicate IV : 2^ x TQ —^ B, 

IV(7,Tq-) = (7 C X) /\ (k G 7) O (U,(Pyuw) > 0), (50) 

kel 

whose solution, following (32), will be denoted, CPiv(Tq^) = 7. 

As with the force-acceleration complementarity problem, this formulation of the impulse-velocity complementar¬ 
ity problem is required to allow for massless limbs, for which a separating velocity is not well defined. However when 
the separating velocity is defined, 

In the language of that paper, this is a Dynamic Complementarity Problem (DCP), and note that (van der Schaft and Schumacher 1998, 
Eqn. 33) asserts complementarity with the base constraint (which they call y = C[q)), but here the position and velocity are already zero, i.e. 
V y € /,ay (q) = Aj{q)q — 0, and so the acceleration is the first degree that must be checked. 

Note that this formulation based only on impulses also simplifies the inclusion of the pseudo-impulse condition (58). 

Note that this excludes those constraints which are touching but separating, but whose post-impact velocity will be penetrating. For such cases 
the application of this first impact will put the state into the guard for those constraints, and thus even in this pathological case the execution will 
continue from the correct mode. 
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Theorem 2. The non-penetrating velocity condition at a contact k after impact into mode J, Ai.q+ > 0 (where such a 
velocity is well defined), is equivalent to a negative contact impulse, U;t(PA:) < 0. impact into mode K := J U 
In other words, (46)-(47) are equivalent to the usual formulation (e.g., (Brogliato et al. 2002, Eqn. 9)), 


U;(Py)>o,Ayq+ = o, vjeiny, (5i) 

U^(Pj) = 0, Aiq+ >0, V ^ e T\J. (52) 


Proof Recall from Section 2.1 that nj is the canonical projection onto the linear subspace spanned by the coordinate 
axes indexed by /, and assume without loss of generality that k is the final index in K such that Ak is defined as in (41). 
The constraint cone, following (Johnson and Koditschek 2013a, Eqn. 78), applied to the contact impulse, (25), is (see 
Appendix B for details, in particular (117) expanding Ak), 


Ua:(P^:) = nkAKAKq = — _ [ A^aY 1 

a,m;a[ 



AkAYAjq -A^q 

AiMWk 


(53) 


while the separating velocity for constraint k after impact into mode J is, using (23), 

Ai.q+ = Ai-q^ -A<-A]^Ayq^ = - (^Ai.MjA[) U<;(P/f). 


(54) 


Since the denominator in (53) is a positive scalar function of state (as shown in Appendix B, (118)), a positive separa¬ 
tion velocity, Ai.q+ implies a negative contact impulse, Ui(PA:), and vice-versa, and therefore (46)-(47) are equivalent 
conditions to (51 )-(52). □ 


Furthermore, with or without a full rank inertia tensor, we assume the existence of a unique solution to the impulse- 
velocity complementarity problem. 

Assumption AlO (Impulse-Velocity Complementarity). The impulse-velocity complementarity constraint, IV, (50) 
always admit an implicit function, CPiv, over the entire state space T Q, as does the modified problem including the 
pseudo-impulse, PIV, (58), introduced in Section 2.7, and under the frictional properties that follow Assumption A12. 
That solution correctly captures the behavior of the physical system. 

As with the FA complementarity problem, there has been a long line of literature (e.g., (Fotstedt 1982, Eqn. 2.10b)) 
that proves that this is always true for independent, plastic, frictionless contacts, however no result has been found that 
covers the pseudo-impulse introduced in Section 2.7 or the limited frictional conditions introduced in Section 2.8. We 
impose this condition in support of the Theorems in Section 3. 


2.7 Pseudo-Impulse 

Impulses arising from impacts (both plastic and elastic) generally break existing contacts. For example an impulse 
imparted to the underside of a rigid block resting on the ground in a gravitational field must cause it to leave the 
ground for a small time interval no matter how weak the impulse or how massive the block. In truth the block is not 
rigid and the impulse is temporally distributed; modeling the resulting subtle deflections would greatly complicate the 
system. It appears expedient to impose a minimum threshold on impulse magnitude below which the system may be 
considered quasi-static and the block remains on the ground, but above which the system is dynamic and the block 
detaches from the substrate. This threshold could be specified directly in terms of a pre-deflned limit on the system 
velocity or impulse magnitudes, however here we introduce a single parameter that allows the cutoff to scale with 
respect to the other problem variables such as velocity and the contact configuration. We show in Section 2.7.1 that 
this induces an implicit bound on velocity. 

In this section we define an additional impulse during impact which qualitatively improves results and eliminates 
some Zeno phenomena. This impulse may be thought of as a tuning parameter and while we give some physical moti¬ 
vation for its magnitude the inclusion of this term is motivated primarily by improving the qualitative behavior of the 
numerical simulation, e.g. by excluding chattering and Zeno phenomena, yet doing so in a manner that retains physical 
fidelity across a broad range of application settings while preserving mathematical consistency. See Section 4.3 for 
example models of physical situations whose physical fidelity and mathematical properties both appear to be enhanced 
by the introduction of such a pseudo-impulse. 

Therefore we will make the following new assumption about the physics of the system. 


19 






Assumption All (Pseudo-Impulse). The continuous time forces apply some small amount of work during the impact 
process. 

In the usual Newtonian impact model, these forces have no effect (as shown, e.g., in (ten Dam and Willems 1997, 
Prop. 4.3)), however here we add an additional pesudo-impulse to support this assumption. 

This assumption is not used in this paper to directly change the energy at any state, but rather is used within the 
discrete switching logic to improve the quality of make-break contact decisions (which implicitly changes the resulting 
kinetic energy of the system). Specifically, consider the pseudo-impulse, P G T*C, that the contacts would impart on 
the system to resist the continuous time forces for some small time duration, 5, G during impact into mode J, 


M5q;=lim f Mqdt ^ (Y - Cq^ -N)5,, 

S [—^0 J 

(55) 

P ; =A]:M5q = a] (Y - Cq^ - N) 5,. 

(56) 


This small time 5i can be regarded as the finite duration of the (actually non-instantaneous) impact process (Quinn 
2005). An alternative interpretation is that 5t specifies the time duration after an impulsive separation of a contact 
during which if that contact were to return to the active set, the qualitative behavior of the system would be improved 
by never considering it as having left. This interpretation, correct to first order for single contact systems, is useful 
when considering what value of 5, should be used. In the simulations shown in this paper a magnitude of 5, = 0.03s 
has been found to give the best results, although this value is surely dependent on the material properties, the system 
velocities, and the desired qualitative and quantitative behavior of the model. 

This pesudo-impulse is not directly applied to the system (as in Quinn (2005)), because in this model impacts 
occur instantaneously and the velocity displacement 5q would not be uniquely determined by (56) when M is singular. 
Instead the pseudo-impulse is added to the plastic impact impulse, (25), and in that way can be considered as a modified 
incoming momentum, 

P;r + Ptr = -A^rA^^q + A^MSq = A^ (Mq + M5q), (57) 

This modified momentum is used as an extra guard condition during impact, U(P + P) > 0, in addition to the usual 
condition, U(P) > 0, since the pseudo-impulse should not break contacts that would otherwise persist. That is, when 
5; > 0 the IV complementarity problem (50) is replaced by the predicate, 

PIV(y,Tq) := (CPiv(Tq) C 7 C I) /\ (k G 7) (Ui(Pjr) > 0VU^(Pjf TP^f) > 0) (58) 

kel 

whose solution, following (32), will be denoted, CPpiv(Tq^) = 7. Note that by construction, 

V Tq G TQ : CP,v(Tq) C CPp,v(Tq). (59) 

The formulation of the complementarity condition based only on impulses in (46)-(47) is key to admitting this mod¬ 
ification, as using instead (51)-(52) would require considering both the impulsive and velocity implications of this 
pseudo-impulse. 

The existence and uniqueness of a solution to (58) will, for the purposes of this paper, simply be assumed under 
Assumption AlO. In numerical studies, we did not encounter simulations that violated this property. However, we 
have not derived sufficient conditions ensuring the property holds for this modified complementarity predicate. 

2.7.1 Velocity Implications of the Pseudo-Impulse 

The IV predicate is provided as a purely logical proposition. However, its truth value varies in physically-interpretable 
ways with respect to variations in the base point at which it is evaluated. In the following. Lemma 10 shows that 
CPiv returns the same answer regardless of the impact speed along any particular impacting velocity direction, and 
Theorem 3 shows that CPpiv does not. See Section 4.3 for a discussion of physical implications of these results in 
simple example settings. 

Lemma 10. Let I <Z}C, (q,q) G TQ be such that there exists a unique k G /C\/ such that ajt(q) = 0 and A|t(q)q < 0, 
i.e. the system will instantaneously undergo impact with exactly one constraint k € K,. Define K ;= /U {k} and, 

V i > 0 : q, := (w - (1 - ^)A^^A^) q. (60) 
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If 5t = 0, then J := CPiv(q,q) C K, satisfies, 


Vi>0:y = CPiv(q,qi), (61) 

that is the solution to the complementarity problem is the same at any impacting speed. 

Proof. The impulse-velocity complementarity predicate (50) is a conjunction of propositions involving conic inequal¬ 
ities; since furthermore the contact impulse (25) is simply scaled (this first identity can be seen using the expansions 
given in Appendix B, (117), and see also (10)), 



AjAI^Ak = Aj [Af + M^... - m]...] Ajf = (Id + 0)Ay + 0 = Ay, 

(62) 


Pi(q,4s) = -AyAy(Id- (1 -s)A^^AA:)q = s(-AyAyq) = sPy(q,q), 

(63) 

we have. 

V; £ 7 : U, (Py(q,q,s)) = U,' (^Py(q,q)) . 

(64) 

Therefore 

V7£7:U,(Py(q,q,)) > 0 U, (Py(q,q)) >0; 

(65) 

the conclusion of the Lemma follows directly. 

□ 


Theorem 3. Assume the same conditions as in Lemma 10. If further there is a unique constraint i £ I\J such that, 

U< (Aju{,}(q)(^(q>qo)-C(q,qo)-N(q,qo))) >0, (66) 


(i.e., a constraint i £ I is impinged upon by external forces but would be removed after impact with constraint k if 
5i =0) then for all values of the pseudo-impulse parameter 5t > 0, there exists s > 0 such that with 4v defined as 
in (60) we have. 

Vs £ (0,s): i £ CPpiv(q,qs): (67) 

that is, the pseudo-impulse will prevent the liftoff of constraint ifor all sufficiently small impacting speeds. 

Proof. For all s > 0 let K{s) := CPpiv(q,qs)- Recall from (59) that CPiv(q,qs) C CPpiv(q,qs) for all s > 0. Thus 
although 7if(s) may not be constant, we are only concerned with the asymptotic inclusion of additional constraints. 
Note that, 

lim P/f(,)(q,q.s) =0, (68) 

and therefore for all values of the pseudo-impulse parameter ^ > 0, 

lim u^(P/f(s)(q,q.v)+P/f(s)(q,qJ) = Vk{^K{s){(iAs))- (69) 

5 '—> 0 + 

for all k £ K{s). By assumption, constraint i is the only constraint such that U,(P) > 0 and, by (58), is to be included 
in the solution. As no other constraints are added or removed, there must exist some s > 0 that ensures (67) holds. □ 

While Theorem 3 is limited to only single constraints, the pseudo-impulse parameter can similarly prevent the 
impulsive liftoff of at least some constraints when there are multiple that satisfy (66), subject to the nature of the PIV 
complementarity problem. 

2.7.2 Pseudo-Impulse Example 

The pseudo-impulse can help to resolve certain Zeno executions, as shown in Section 3.5.2, but more importantly 
it reduces “chattering”, or executions that involve many impulsive transitions that are qualitatively undesirable. An 
example can be seen in the RHex leaping simulation of Figure 1 - see also the additional examples explored in 

Section 4.3. Figure 3 compares the state just before and after the rear leg touches down with and without the pseudo¬ 

impulse term. At that instant the calculated impulse (25) is Pi = — 1.47Ns (in the normal direction on the front leg). 
Even though the leg motor is applying maximum torque trying to keep the leg on the ground the small negative impulse 
will cause the leg to separate, and then the motor torque will quickly accelerate the leg back to the ground (with or 
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Figure 3: Keyframes around the impact of the second leg with the ground - note the difference in contact forces 
(blue arrows) which indicate which contacts are active. Top Row: Without pseudo-impulse (5, = 0). Bottom Row: 
With pseudo-impulse (5; = 0.03). The center top frame shows that the contact with the front leg is lost when the rear 
leg touches down (and therefore no contact force is possible), but the center bottom frame shows both contacts are 
maintained with the pseudo-impulse (and therefore both contacts can apply forces to the system). Note that there is a 
slight difference in touchdown time due to similar discrepancies around the time of the first leg touchdown. See also 
Figure 4. 



Figure 4: Comparison of the front leg normal direction ground reaction force for evaluations with and without the 
pseudo-impulse. See also Figure 3. 
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without massless legs, as recall that even massless legs are assumed to have finite acceleration, thus the leg may return 
quickly but not instantly). With the pseudo-impulse this is balanced out by Pg i = 7.91Ns, and the leg does not leave 
the ground (as would be the case on the real robot in this configuration to within modeling precision). If the induced 
impulse were much larger then the desired result may be for the front leg to lift off the ground, while a much smaller 
impulse would clearly not break the front leg’s contact. The 5t term is in essence a tuning parameter that determines 
the threshold between a quasi-static regime (where contacts are maintained) and a dynamic regime (where impulses 
may break existing contacts). 

Impulsively breaking contact at the wrong time is an even bigger problem when considering a full behavior and not 
just analyzing an individual impact event. As Figure 4 suggests, without a pseudo-impulse this impulsive liftoff can 
lead to chattering. In this case starting around t = 0.023 the front leg lifts off but the continuous time forces return the 
leg to the ground after a short time. When the front leg impacts the ground, the rear leg then impulsively breaks contact, 
and a cyclic oscillation begins. This behavior is not quite a Zeno-execution, as the finite acceleration of the leg in the 
air results in only finitely many transitions in a finite time, however these transitions are still qualitatively undesirable. 

2.8 Friction 

While this paper is not focused on methods for modeling friction, including friction in some form is unavoidable 
(McGeer and Palmer 1989). Here, in order to advance our targeted conclusions guaranteeing the existence and unique¬ 
ness of a solution, we will assume that. 

Assumption A12 (Friction). All contact points with Coulomb friction are attached only to massless links. Contact 
points without friction are assumed to never resist sliding motion, and all contact points that are sliding have no kinetic 
coefficient of friction. No sliding-to-sticking transitions are considered. 

The velocity constraint. A, unilateral constraint cone, U, and complementarity problems, CP, are thus taken to 
include any active frictional constraints - see Section 2.1 or (Johnson and Koditschek 2013a, Assumptions C.3 and C.4) 
for details. 

This restrictive frictional assumption ensures that during impact (i.e., in evaluating CPjv, the impulse-velocity 
complementarity problem (50)) any conflict involving the frictional constraints can be resolved by simply removing 
that contact (the normal and tangential constraints) from the active set (see Section 1.2.5 for a summary of pathologies 
that arise when this assumption is relaxed). As a massless link, it can always rotate out of the way fast enough (as 
discussed above in Section 2.6). 

In continuous time (i.e., in evaluating CPfa. the force-acceleration complementarity problem (38)) this frictional 
assumption as applied to the RHex model states that the body has a low coefficient of friction and does not resist 
tangential forces while the legs’ rubber feet have a high coefficient of friction and therefore always do. This a priori 
assumption about friction is certainly not a good model for every situation - consider what happens when RHex’s 
legs push against each other, as with the vertical leap described in Johnson and Koditschek (2013b) (see in particular 
Footnote 8 and the end of Section V.A). In order to model such a behavior the leg contact points must be allowed 
to transition to sliding contact when the contact forces reach the friction cone in the tangential direction, Ui(A) > 0 
(Johnson and Koditschek 2013a, Eqn. 4), as with the liftoff condition in (35)-(36). Indeed it appears that allowing 
such sliding modes is required to for a solution to the frictional force-acceleration problem to exist in general (McGeer 
and Palmer 1989). 

After transition to sliding the kinetic coefficient of friction is taken as jif; = 0 (as with the frictionless body contact 
points) so that the jamming problems discussed in Section 1.2.5 are again avoided. Adding this contact mode enables 
for example the simulation of the vertical leap shown in Figure 5 or the leap onto a ledge shown in Figure 10, which 
each use a nominal value of jJ. = 0.8. This finite friction coefficient is less critical for the forward leap of Figure 1, 
though still used in this case with the relatively large but finite values of 1.8 for the front leg and 2.5 for the rear. 

The transition from sliding to sticking is much trickier. A sliding constraint will stick when the tangential velocity 
drops to zero and the resulting contact forces lie within the friction cone, i.e. contact k € /C, is to be added if and only 
if its corresponding normal constraint is active (a(k) £ 1) and. 


Ai-q = 0 A Ui:(A/u{<:}) ^0. (70) 

However this additional condition complicates the force-acceleration complementarity problem and furthermore is not 
needed to model any of the leaping behaviors in Johnson and Koditschek (2013b), our motivating scenario. Therefore 
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Figure 5: Keyframes from RHex simulation leaping vertically to a height of 37cm. Blue arrows show contact forces 
while the red arrow shows body velocity. The coefficient of friction is /i = 0.8 and the relative leg timing is t 2 = —0.06s. 
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Hybrid system domain (Def. 2) 
Vector field (Def. 2) 

Guard set (Def. 2) 

Hybrid dynamical system (Def. 2) 
Discrete indexing set (Def. 2) 
Reset map (Def. 2) 

Hybrid time domain (Def. 3) 

Set of discrete transitions (Def. 2) 
Contact word of length N (Def. 4) 
Execution of the system (Def. 4) 


Table 2; Hybrid system and execution symbols, with definition of introduction marked. See also Table 1 for symbols 
introduced in Section 2. 


for the purposes of this paper we will not consider such slip-to-stick transitions. This limitation will, e.g., prevent the 
modeling of a leg that slides upon contact with the ground but gains traction later using static friction. 

3 Hybrid Dynamics 

In this section we first define a general O'" hybrid dynamical system (Section 3.1) that is then instantiated as the main 
object of study for this paper, the self-manipulation hybrid dynamical system (Section 3.2). Section 3.3 establishes 
that this system is indeed a C'^ hybrid dynamical system and Section 3.4 further shows its internal consistency. Finally, 
Section 3.5 shows that Zeno executions of the system accumulate and that the pseudo-impulse truncates certain Zeno 
executions. 


3.1 Hybrid Dynamical System 

In the following definitions we make use of the natural (disjoint-union) topology on the hybrid state space, consisting 
of a collection of manifolds with corners (Joyce 2012, Def. 2.1); see Appendix D or (Burden, Revzen and Sastry 2015, 
Sec. II) for more details. The hybrid system notation introduced in this section is summarized in Table 2. 

Definition 2. A C'^ hybrid dynamical system, r G NU {0°, Co}, is a tuple H := F,27, iF,g,TVj, where the constituent 
parts are defined as: 

1. J := {/,2,... ,K} C N is the finite set of discrete modes; 

2. r G ff X f[ is the set of discrete transitions, forming a directed graph structure over J; 
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Figure 6: Illustration of elements from the C’’ hybrid dynamical system (Definition 2) for the RHex model. 
Note that this is a 5-dimensional model (dimQ = 5), so we cannot faithfully represent the domains and guards 
on the printed page; instead, we illustrate a two-dimensional slice of height z and body pitch (j). From the un¬ 
constrained mode £){} there are three possible discrete transitions corresponding to touchdown of the front leg 
(G{} { 1 } C {(q,q) S FQ : ai (q) = 0}), rear leg (a2(q) = 0), or simultaneous touchdown of both legs. We annotate 
the reset map corresponding to front leg touchdown, but emphasize that there are corresponding maps defined 

over all the guards Gij. Each constrained mode also generally contains liftoff guards (e.g. C D{i}); these are 

not illustrated. 


3. T) ;=IJ/g Di is the collection of domains, where Di is a manifold with corners; 

4. T '.T> ^ TT) is a hybrid map that restricts to a vector field Fj := J-\dj for each I G J; 

5. Q :=IJ(/ yjgj- Gij is the collection of guards, where Gj j C Djfor each {I,J) € F; 

6. TZ : Q ^ F is a continuous map called the reset that restricts as Rj j := TZlcj j ■ Gij —^ Dj for each {I,J) S F. 

Before we proceed, we make a few clarifying comments about this definition. While F is a directed graph it is 
not generally a tree (i.e., {I,J) and (7,7) may both be members). When we write Q Gj j where Gjj C Dj 

for each (7,7) G F, we are simultaneously specifying that (i) each Gjj is an arbitrary subset of Dj and (ii) Q is the 
finite disjoint union of these subsets. The domain F should be regarded as a C'' hybrid manifold as described in 
Appendix D since each Dj is a C'' manifold with corners. In contrast the guard Q does not generally possess a smooth 
structure since each Gi j C Dj is not even required to be a topological manifold. We will say that 77 has disjoint guards 
if Gij n Gi l = 0 for each pair (7,7), (7, L) G F such that J f^L. An illustration of some of the elements of a C'^ hybrid 
dynamical system is shown in Figure 6. 

Roughly speaking, an execution of a hybrid dynamical system is set in motion from an initial condition in F by 
following the continuous-time dynamics determined by the vector field F until the trajectory reaches the guard Q, at 
which point the reset map TZ is applied to obtain a new initial condition. We formalize this using the notion of a hybrid 
time domain. 

Definition 3. A hybrid time domain is a disjoint union of intervals T ^^oh that: 

1 . A G NU{°°},' 

2. 7] C K is a closed interval for all i < N, and ifN < °° then 7)v C M is also a closed interval; 

3. Tj n 7j+i is nonempty and consists of a single element for all i < N. 

Note that an interval may be degenerate, i.e. 7j = {f,}. We define supT := sup|J/Li F- 

This definition is essentially equivalent to the hybrid time trajectory (Lygeros et al. 2003), the hybrid time set 
(Collins 2004), and the hybrid time domain (Goebel and Teel 2006), and enables us to formalize the conceptual 
description of the domain of a hybrid execution from Back et al. (1993) as being divided into contiguous epochs 
separated by events where the reset map is applied at an instant referred to as an event time. Furthermore, this definition 
has two appealing features. First, an execution (defined below) becomes a smooth (hybrid) map defined from a 
hybrid time domain T into the continuous state space F of the hybrid system, avoiding the use of set-valued maps or 
cumbersome left- or right-handed limits; see Appendix D for the definition of smoothness for hybrid maps. Second, it 
treats the model of time in the same class of mathematical objects as the model for the state space, namely, a disjoint 
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union of smooth manifolds with corners. Note that under this definition a transition time f, £ 7] fl 7]+i appears in two 
consecutive components of the time domain 7) and TJ+i, allowing the flow on each interval to include both endpoints. 
Also note that this allows for two transitions (or more) to occur at the same instant in time, e.g. it is possible that 
7] = {h} ,7;_i n 7] n 7}+i n ... = {?,}; the middle portion of the trajectory would have been excised from a left- or 
right-handed definition of execution, or potentially muddled with the surrounding trajectory portions in a set-valued 
definition. 

Definition 4. An execution of a hybrid dynamical system 77 = {ff ,Q ^TV) is a smooth map x - T ^F> over a 

hybrid time domain T 7} satisfying: 

1. V/ £ N,! < N: iff is not a degenerate interval then ^x|7;(0 = faf till t £ 7j-; 

2. V i <N: for{ti} = 7jn7j+i (f/ze event times), we have xWiiti) £ G, =X|7;+, (h), and for all s £ 7j\{f,} 

we have ; 7 | 7 ; (^) ^ G- 

The execution has an associated word denoted by cy{x) — Wi}f=i ^ 77"^ that specifies the sequence of discrete modes 
encountered by the execution: xWi F Dj. for all i £ N, / < N. An execution X ■ V is maximal if it cannot be 
extended to an execution over a longer hybrid time domain. We say^^ that 77 is: deterministic if for every initial 
condition x £ 77 there exists a unique maximal execution X - T ^F> such that x\ti (0) = x; non-blocking if for every 
initial condition x £ 77 and any maximal execution J : T ^ 77 with x\ti (0) = x, then with T =IJiLi 7) either N = °° 
or N < °° and = [fAr,°o). 

The contact word, Ct{x)^ also called the contact motion plan, is useful for comparing and reasoning about different 
executions of the hybrid system (Xiao and Ji 2001). 

3.2 The Self-Manipulation System 

While the previous hybrid system specification is very general, it is useful to instantiate it for a model of a physical 
system. This section will define the self-manipulation system (Johnson and Koditschek 2013a) (and by the analogy of 
that paper, equivalently a manipulation system (Murray et al. 1994), as summarized in Section 2.2), where the discrete 
mode, I C 1C, corresponds to the active contact contact mode. 

Definition 5. A self-manipulation hybrid system is a C'^ hybrid dynamical system, 77^ = (77,F,77,7^,7),7^), defined 
as follows, 

3.2.1 Discrete Modes 

The set of modes, all physically permissible combinations of contact constraints, is given by, 

J={7£2'^:a,(;'(O)^0Aa(4)c7}, (71) 

that is there are two requirements: 1) there must exist some point, q £ Q, such that these normal contact constraints 
are active, a/„ (q) = 0, and 2) no tangential constraint is included whose corresponding normal constraint is not also 
included, i.e. $i Gf : a{i) ^ 7. 

3.2.2 Edges 

The set of edges is made up of any pair of modes whose union is also a mode - in other words, sets arising from the 
intersection of the two base sets that satisfy respectively the two sets of normal constraints, 

r = {{I,J)GjxJ:IfJ,IUJGj}. (72) 

This set of edges can be further restricted based on the guards, defined below, as there will be some transitions 
(7,7) £ r where no points in D/ satisfy the requirements of the guard, i.e. Gj j = 0. In that case we can reduce the 
edge set further, 

f = {iI,J)Gr:Gij^0}. (73) 

Following Lygeros et al. (2003). 
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3.2.3 Domains 


The domain associated with a contact mode / G is the subset of the ambient tangent bundle T Q that satisfies the 
normal non-penetration and tangential non-sliding constraints, 

Di={iq,q)^TQ: a/„ (q) = 0, aic„ (q) > 0, A/(q)q = 0}, (74) 

where recall that Q : = 0 x ^^(d) is the joint space combined with the position space of the body. 


3.2.4 Flows 


The vector field on each domain is based on the self-manipulation dynamics for q, as in (12) and (Johnson and 
Koditschek 2013a, Eqn. 33, 72), are. 


^/(q,q) 


q, M\Y-Cq-N)-A]'^Aiq 


(75) 


for the coordinates in Qj, and recall from Assumption A6 (unconstrained massless limbs) that the coordinates asso¬ 
ciated with unconstrained massless limbs lie in the subspace Qj and evolve according to the vector field F, such that 
the combined vector field is complete over all of T Q. The control input T G T*& that appears in T is prescribed by a 
C'^ function of state T G C''{TQ,T*Q) (for example a fixed-voltage motor model T, = KpKc{l — KqQi) (Johnson and 
Koditschek 2013a, Sec. 1V.C.4)). 


3.2.5 Guards 

We find it convenient to construct the guard set, for mode /, G/ C £>/, as a union of subsets indexed by its “outgoing” 
edges, (7,7) G F, using the touchdown predicate (22) and the complementarity problem predicates (58) and (38) 
specified as^*'. 


Gu = {Tq e D/: NTD(Tq) ^ PIV(7,Tq), 

(76) 

^NTD(Tq) ^FA(7,Tq)|. 

(77) 


Conceptually, the component of the guard for mode I associated with edge {I,J) consists of any base states, q, at 
which any new touchdown event can occur from mode I into mode J, according to the NTD predicate, (22), subject to 
PIV complementarity, (58). An additional condition on the base and tangent states, Tq, is that if no new contacts are 
touching down (“liftoff”), then FA complementarity holds (38). 

The outlet set in domain /, defined as Gj := UjGjj, is. 


G/^^TqsD/: V a,(q) ^ 0 U/ VUK^/) ^ 0 


KkeK„\I 


^iei 


(78) 


The outlet set will be used in the proof of Theorem 7, but more importantly provides a computationally expedient 
method of simulating an execution: first check if Tq G Gj, then determine the subsequent mode afterwards. 


r CPpiv(Tq), NTD(Tq) 

\ CPFA(Tq), -NTD(Tq). 


3.2.6 Reset Maps 

The reset map associated with edge (7,7) G F (taking its domain exactly on Gjj, defined above) is. 


^/j(q,q) = [q, q-Aqy]= q, q-A^Ajq 


(80) 


Note that for takeoff events, Jn C /„, the prior velocity will already agree with the new contact mode and therefore the 
impact map will have no effect. 

Note that the requirement that {1,J) € P and therefore J ^ J ensures that all tangential constraints in the new contact condition must have a 
matching normal constraint also (or trending so). Furthermore, note that we will shortly observe in (73) that only some of these outgoing edges 
make a non-empty contribution, Gy,y, to this union. 
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3.3 The Self-Manipulation System is a Hybrid System 

This section will show that. 

Theorem 4. The self-manipulation system (Def. 5) is a C® hybrid dynamical system (Def. 2). 

Proof. Definition 2 has a number of requirements and so we will break this proof up into the constituent parts and 
show that each component of Definition 5 is compatible with the requirements. 

1. in (71) is a finite set, the only requirement on ff. 

2. r in (72) is a subset of J x J hy construction, and F in (73) is a subset of F. 

3. By Assumption A2 (simple constraints), for d\\ I £ J the maps a/„ G C“(Q,C/„) and A/ € C®(7’ Q,TC/) are 
constant rank, and therefore each D/ C TQ, as defined by these functions in (74) is a closed C“ manifold with 
corners (Lee 2012, Thm. 5.12), (Joyce 2012, Def. 2.1). The Nash Embedding Theorem (Nash 1966) implies that 
Q can be embedded analytically in Euclidean space of sufficiently high dimension; this embedding therefore 
restricts to an embedding of the submanifold D/. 

4. Eor all (q,q) € Dj, F/(q,q) € Ijq qjD/ for F/ given in (75) (based on (12), (Johnson and Koditschek 2013a, 
Eqn. 33), which enforce the equality constraints of the definition of the domain, (74), and therefore lie within 
TDj) and hence we may write Fj G C“{Di,TDi). 

5. Gij in (76) is a subset of Di by construction. 

6. Eor the reset map in (80), lmRij{Gij) C Dj, as the domain D/ has three requirements (74): 

1) a/„ (q) = 0, i.e. any normal constraints are touching the surface indicated. Eor pre-existing constraints, {/: 
i G In ny„}, this requirement is already guaranteed, i.e. q G Gij C Dj ^ a/,, = 0, and the reset map does not 
alter the base coordinates q. New normal constraints in mode J, {j '■ j G J„\In}, satisfy this requirement by the 
touchdown predicate in the guard (76) (and (21)), where TD( 7 ,Tq) is true only when ay(q) — 0. 

2) aicM) > 0, i.e. all base constraints are non-negative. Again since the reset map does not alter the base 
coordinates q, then q G G/j C Dj ^ ajc„ > 0. 

3) Ay(q)q = 0, i.e. any velocity in constrained directions is zero, but this is guaranteed by the reset map as 
Ayq+ = A/q^ — AyA^ Ayq^ = 0. Therefore, as claimed, the image of the reset map (80) lies within Dy. 

□ 

3.4 Consistency Properties 

This section establishes several additional properties of the self-manipulation system that are of practical importance. 
Theorems 5-9, which we shall for convenience collectively call consistency properties. 

Theorem 5. The self-manipulation system (Def 5) has disjoint guards. 

Proof. The disjointedness of the guards follows directly from the assumption of uniqueness of solution for the con¬ 
stituent complementarity problems. Define the liftoff predicate where no contact is touching down, 

LO(/,Tq) := - NTD(Tq) A V A 0^ , (81) 

and so by a refinement of the complementary block of the partition defined on the right hand side of (79), 

r CPpiv(Tq), NTD(Tq) 

CPG(/,Tq) = <^ CPFA(Tq), LO(/,Tq) (82) 

I /, otherwise, 

is equal to either the unique mode J for which Tq G Gij, or simply I if the state is not in a guard, Tq ^ G/ yV {I,J) G 

F. ’ ’ □ 

In their most general formulation, hybrid dynamical systems can accept executions that terminate before infinite 
time (continuous or discrete) has elapsed, or accept multiple distinct executions from the same initial condition. This 
behavior is undesirable in practice, and inconsistent with our experience on real manipulation and self-manipulation 
systems. Necessary and sufficient conditions (Lygeros et al. 2003, Lems. III.l, III.2) have been formulated that ensure 
a system is deterministic and non-blocking. Since these conditions are applicable to a general class of hybrid dynamical 
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systems, they can be difficult to verify directly for particular classes of models. No previous authors have established 
that these conditions hold for any broad class of hybrid system models for Lagrangian dynamics subject to multiple 
unilateral constraints, much less with the particular structure of the self-manipulation system. The conditions in 
(Johansson et al. 1999, Lem. I & 2) come closest, as they would apply to an instance of a self-manipulation system 
exhibiting only a single constraint. 

To serve the needs of the present paper, we introduce an extension of the line of reasoning in Lygeros et al. (2003) 
establishing that the self-manipulation system. Definition 5, is indeed deterministic and non-blocking. Definition 4, in 
the presence of an arbitrary number of unilateral constraints. 

Theorem 6. The self-manipulation system is deterministic. 

Proof. Assumption A7 (Lagrangian dynamics) imposes a partial flow on each component Dj, hence continuous tra¬ 
jectories are unique and nondeterminism could only be introduced through a reset. But the definition of execution, 
Def. 4, implies that a discrete transition occurs at Tq G D/ if and only if there exists J G J\ {/} such that Tq G Gi j. 
Since the guards are disjoint by Theorem 5, there is at most one guard containing Tq. The execution continues from 
/?/,j(Tq)GDy. □ 

The non-blocking property is a bit more subtle as the self-manipulation systems escape some of the structure 
required to handle the more general class of systems addressed in Lygeros et al. (2003) and used there to establish 
conditions for non-blocking. For self-manipulation hybrid systems the non-blocking property arises from the discrete 
logic and continuous dynamics in an essential manner that we now rehearse informally in preparation for the statement 
and proof of Theorem 7. The guard, G/, intersects the corresponding domain, D/, both on the boundary of the domain 
(to handle impact on an erstwhile inactive constraint) as well as in the interior of the domain (to handle a sign change 
on some active constraint’s contact force). An execution might be blocked by conventional finite escape, i.e., if the 
continuous flow brings some initial state to the boundary of the domain at a point in the complement of the guard in 
finite time. Alternatively, it might be blocked by hybrid ambiguity, i.e., if the continuous flow brings some initial state 
to some point that is in the complement of the guard but still lies on the boundary of the guard, for this would violate 
the semantics of execution that restricts continuous flow to closed intervals (formally, Def. 4 requires a minimum - 
not merely an infinum - time of entry into a guard). In the following proof we will preclude both cases by showing 
that the guard contains all points reached by the continuous flow that lie in the boundary of the domain or the interior 
of the domain but the boundary of the guard. 

Theorem 7. The self-manipulation system is non-blocking. 

Proof. Recall from part 6 of the proof of Theorem 4 that the image of the guard set under the reset map is within the 
domain (and thus the discrete transition will never be blocking), and from Assumption A7 (Lagrangian dynamics) that 
the flow is forward complete over T Q. Therefore we need only check that the flow will only reach the boundary of 
the domain or the boundary of the guard within the domain interior at a point which is included in the guard. 

Recall from (74) the definition of D/, and note that it is a subset. 


D/C 


Tq G Q : /\ ak{q) > 0 
kelC„\i 


(83) 


where furthermore. 


Aa,(q) = 0 A AA/(q)q==o 


V'64 


uei 


(84) 


Note that the constraints in (84) are invariant under the flow of (12), whence under the completeness assumption 
(Assumption A7) it is only possible to flow out of D/ in forward time by violating one of the inequality conditions 
of (83). 

Recall from (78) the union of all guards, G/, and then using Lemma 1 note that the closure of the union of all 
guards is. 


Gi 


Tq G D/: \/ a,(q) < 0 V VU^^/) < 0 

I \keK\i / \iei } 


(85) 
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Now consider an arbitrary point in the domain, Tq G D/. If fl(Tq) > 0 for all ji G {^k}kelC\i 
Tq is on the interior of Dj\Gj and it is possible to flow for positive time while remaining in the domain Dj and not 
reach a guard Gj. Otherwise there exists k G /C\/ such that ai;(Tq) = 0 (i.e., the state has reached the boundary of the 
domain) or there exists i G I such that U,(A/(Tq)) < 0 (i.e., a sign change on some active constraint’s contact force). 
We now consider the two (mutually exclusive) possibilities concerning whether a contact condition or an active force 
is trending negative: 

1. a<;(Tq)0 for all k G /C\/ or U, (/l/) ^ 0 for all i G /; 

2. there exists k G /C\/ such that a,t(Tq) ^ 0 or there exists i G I such that U,(A/(Tq)) -< 0. 

In case 1), when there is neither a negative trending contact nor active force, then it is possible to flow for positive time 
in the domain without intersecting any guard or leaving the domain (Lemma 3); this provides the unique extension to 
the execution. The contrary case 2) is just the situation the hybrid system’s logic is designed to flag: i.e., Tq is in a 
guard, (78), so the application of the reset map provides the unique extension to the execution. 

Therefore every initial condition Tq G D yields a unique execution defined over a hybrid time domain that spans 
infinite time (continuous or discrete), whence the self-manipulation system is non-blocking. □ 

The self-manipulation hybrid system may undergo multiple hybrid transitions in succession at the same time f, as 
there is no “dwell time” requirement to continue after reset under the continuous dynamics for any minimum amount 
of time. Therefore it is important to bound the number of such multiple transitions to ensure that the continuous 
execution will eventually continue over an open interval of time. Here, Theorem 8 relates the image of the reset map 
to the guard sets to show that continuous execution continues after at most two successive hybrid transitions. 

As a simple example, consider the self-manipulation system model consisting of a point mass in a gravitational 
field that points away from a constraint surface (i.e., a ball under a ceiling). If the mass is initialized with a velocity that 
causes it to impact the constraint surface, it will transition first to the constrained (ceiling) contact mode through an 
impact that ensures zero relative velocity. After spending zero time in the constrained mode, and therefore at the same 
continuous time, the system will transition again back to the unconstrained mode as the mass succumbs to gravity. The 
execution continues in the unconstrained mode as the mass accelerates away from the ceiling. In the self-manipulation 
system these are treated as separate transitions. At the expense of a small amount of additional bookkeeping in the 
definition of execution, we dramatically simplify the specification of the reset map (in this example, eliminating the 
need for a reset map from the unconstrained mode to the same unconstrained mode consisting of impulses from a 
constraint not present in either the original or destination mode). 

Theorem 8. An execution of a self-manipulation hybrid system without massless limbs may undergo no more than 
two hybrid transitions at a single time t. 

Proof The guards are partitioned into two types by the new touchdown predicate, NTD(Tq), (22), into touchdown 
and liftoff (non-touchdown) components. It suffices to show simply the reset map (i) takes states that are in touchdown 
guards to either non-guard states, or states in a liftoff guard, and that (ii) the reset map always takes states in the liftoff 
guard to non-guard states. 

To show (i), note that for all points Tq in a touchdown guard, (76), the impulse-velocity complementarity ensures 
that all constraints k that are touching but are not in the outgoing contact mode J have a separating velocity, > 0, 
after the application of the reset map, if such a velocity is well defined (52) (as in Theorem 2). Therefore all constraints 
k not in J must either not be touching (ai,(Tq+) > 0) or have a separating velocity (A<.q+ > 0), but therefore cannot 
satisfy the touchdown predicate, TD(k,Tq+), (21), and so NTD(Tq+) is false and the state after the reset map is either 
not in a guard or is in a liftoff guard. 

To show (ii), note that for all points Tq in a liftoff guard, no contacts are touching down, i.e. V k G /C„,TD(k,Tq) 
is false. The force-acceleration complementarity problem that defines these guards, FA(y,Tq), (38), does not depend 
on the active mode, I. Furthermore, the reset map /?/./(Tq) is simply identity, and so the state remains the same after 
this transition, Tq = /?/j(Tq). Therefore Tq G Gi.j such that NTD(Tq) is false and FA(y,Tq) is true implies that 
NTD(/?/. 7 (Tq)) is also false and that FA(y,/?/./(Tq)) is still the correct solution, and the state after the reset map is 
not in any guard. 

Therefore at any given time f, the system can undergo at most two transitions - first if some contact is touching 
down the system will undergo an impulsive transition, and then if some contact force is trending negative it will 
undergo a smooth liftoff, but no further transitions are possible at that contact mode and state. □ 

Finally, the last consistency property considers the dynamics of the discrete modes. A general C'' hybrid system 
whose domains are intersecting subsets of some ambient domain (as is true for the self-manipulation hybrid system) 
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need not define a unique execution for a given state from any appropriate mode, even if the hybrid system is determin¬ 
istic and non-blocking. A given state Tq G Q will in general be a member of more than one domain, such as the corner 
point in purple from Figure 6 or, for an arbitrary point, any subset of the current constraint mode. As such, we must 
ensure that the evolution of the system will not be biased by ascribing the “incorrect” mode to that state, nor capable 
of supporting more than one word (discrete mode sequence) over a given sequence of continuous time trajectories 
associated with an execution. 

Theorem 9. From an initial state Tq at time to and any contact mode I consistent with that state, i.e. / G := {/ G 
J ; Tq G £>/}, the execution will be uniquely defined (in both state and contact mode) for all t > to after undergoing 
up to one hybrid transition. 

Proof. The proof considers in turn the two mutually exclusive cases defined by the truth or falsity of the predicate 
NTD(Tq), (22), and in each case the execution will be uniquely defined due to the uniqueness of the corresponding 
complementarity problem. 

If NTD(Tq) is true then there is some additional constraint j that is impacting, i.e. TD(j,Tq), (21), where note 
that y I & Jq,i ^ I. Therefore from any consistent mode I G J(^, Tq is in some guard Gpj, (76), determined by 
PIV complementarity, J = CPpiv(Tq), (58). Since the reset map depends on J but not /, and J is unique by the 
impulse-velocity complementarity assumption (AlO), the execution will continue from the unique point (Tq)). 

If NTD(Tq) is false, then for any / G J7q the system could be in a liftoff guard. Consider the mode J = CPFA(Tq), 
(38), uniquely defined for a given Tq by the force-acceleration complementarity assumption (A9). If / = 7 then the 
state is not in a guard and therefore no reset map will be applied. Otherwise Tq G G/y, (77), and the system will 
undergo a hybrid transition, though recall that liftoff reset maps are the identity map. In either case the execution will 
continue from the unique point (7, Tq). □ 

3.5 Zeno 

An execution for a hybrid system is referred to as Zeno if it undergoes an infinite number of discrete transitions in 
finite time (Lygeros et al. 2003, Def. II.3). 

Definition 6. An execution X : T — ^ 'D for a hybrid dynamical system PL = (ff,T,'D,F,Q,R) over a hybrid time 
trajectory T =IJ^i f is Zeno ifN = °° and^fLi l^'l <°°- 

Zeno executions need not accumulate in a general hybrid system, that is, the limit lim,^supTZ(0 be unde¬ 
fined (Zhang et al. 2001, Def. 6). However, Lagrangian systems subject to unilateral constraints give rise to unique 
trajectories defined for all time (Ballard 2000, Thm. 10). We show in Section 3.5.1 that the self-manipulation hybrid 
dynamical system (Def. 5) models this property through the mechanism of Zeno executions accumulating on a unique 
limit in the ambient space T Q, from which the hybrid execution proceeds through the next smooth component (and so 
forward, continuously, through ambient time). Then in Section 4.4 we discuss extensions and connections with results 
in the literature. 

3.5.1 Accumulation of Zeno executions 

In the following Theorem, we rely on several results originally obtained using sophisticated measure-theoretic tech¬ 
niques (Ballard 2000). At the expense of additional notational overhead, we reproduce the necessary arguments in our 
hybrid system framework using elementary mathematical machinery. 

Theorem 10. Given a self-manipulation hybrid system with a complete connected configuration manifold Q, if the 
inertia tensor M is non-degenerate and the forces abide by the bound in (19), then the projection of any Zeno execution 
X :T into the ambient state space T Q, no x '.T TQ, accumulates on a unique limit, 

(q,q):=4imr?^oz(0- (86) 

Proof. Let Z ■ ^ —?> 27 be a Zeno execution over the hybrid time trajectory T =IJ^i f. With 7) fl 7)+i = {f,} for all 
i G N, let t = supT < °°. For notational convenience in this proof we will let 7] = [tj^iff] for all i G N. This notation 
is somewhat redundant since = tf G M for all i G N; we use it to signify that tf G 7] and G 7j+i. Note that 
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=0 since = t. When there should be no confusion as to the index of the time domain, we 

will abuse notation by supressing the index and simply write q(f) instead of q(/,f) for t G 

Let n ^ TQhe the canonical projection that simply removes the label from the disjoint union 1) =Wiej Di, 
and let (q,q) := denote the velocity and position components of the execution X- Note that since the reset 
map, (80), does not change the position, q, the position trajectory q : T —?► Q satisfies, 

q|7;(fi) =q(fr) = = (87) 


for all i G N, i.e. positions evolve continuously with respect to time. Therefore q uniquely determines a continuous 
curve q : r —Q over the half-open interval T = IJIli Ti = [0,F). The restriction qlj; is continuous for every i G N, 
therefore it uniquely determines a right-continuous curve ^q,q^j '■ T -G- TQ. The bound in (19) ensures that the 
velocity is bounded on finite time horizons. 


V := sup 
teT 



< 


( 88 ) 


as shown using a sequence of standard results in Appendix E.l (and adapted from the proof of (Ballard 2000, 
Thm. 10)). For any nondecreasing Cauchy sequence, C T such that s, —?> 7, the sequence {q(si)}Jli is also 

Cauchy since. 


^ __ r^n 

Vn,OTG N;%(q(s„),q(sm)) < / 

Js,„ 


q^(«) 


_ds < V |i„ — s„ 


M 


(89) 


Therefore the position tends to a unique limit in the ambient state space, i.e. the following limit exists: 


q:=limq(f). 


Under the simple constraints assumption (A2), the Rank Theorem (Lee 2012, Thm. 4.12) ensures there exists a coor¬ 
dinate chart (y, I//) near q where A^; = [ld,0]. Continuity of q ensures there exists t_GT for which q([L7)) C V . 

Let g{x) = C J denote the sequence of discrete modes visited by % and let m := min{! G N : f; > f}. 

Specializing the definition of execution of a hybrid system to the self-manipulation system and performing integration- 
by-parts as in Appendix E.2 we conclude that in coordinates, 

V/>OT,f G [ttvh] :M(q(f))q(f)-M(q(f+J)q(f+j) 

= ^ (^Y(q,q)-N(q,q)-C(q,q) - Ay,(q)^Ay,(q,q)^ <7^, (90) 

M(q(f+))q(f+) -M(q(fr))q(fr) = -P,,, 


where is defined in (24) and for each f G {1,..., q} the f-th coordinate of the covector C G T*Qis given by. 


^ ^ Ait.:.; 

Recursively substituting using (90) and (87), for any f € l^m ,t) with m : = max{! G N : f < f,- < f}, the velocity compo¬ 
nent of the execution (q,q): T ~^TQ satisfies. 


ds 


M(q(f))q(f) - M(q(f«))q(y 

= (^Y(q,q) -N(q,q) -C(q,q) - Ay_(q)^ly_(q,q: 

+ E / fY(q>q)-N(q,q)-C(q,q) -Ay,(q)^ly,(q,q)') c/s-Py, 

i=m \ / 


(91) 


Noting that for all time t G (fyLiT,- ) on the interior of each time interval i > m that q(f) = q(f) and q (f) = q(f). 
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we conclude that for all t G [tm,t) the right-continuous velocity 


q 


:T ^TQ satisfies. 


M(q(f))q^(0-M(q(fm))q^(fm) 

= / Y(q,q^)-N(q,q^)-C(q,q^)£/i- fAj-{q)^Xj-{q,q^)ds 


i [/". 

i=m 


Ay,(q)%(q,q 


( 92 ) 


This equation, (92), is the transcription of (Ballard 2000, Eqn. 36) into our formalism. 

Recall that in coordinates (V, \j/) we have Ajc = [ld,0] and that U(P) >0^P<0, U(A)h0=J>A<0. The 
complementarity conditions, (37) and (49), thus ensure that each component of AjXj and Py are non-positive for 
each J G J. 

We conclude by rearranging (92) (and suppressing dependence on q and q ) and invoking the bound from Ap¬ 
pendix E.3 that there exists a, j3 G R such that for each j G {1,..., |/C|}, 


Xl,ds + Pj. 


0 <E I K 

i=m L-''/-! 

< a + )3(f-f„). 


7' 

Jtw 


Xj ds = — 

dm 




'(f)q+(f)-M7r^)q+(fjl + fv 


- - ads 


(93) 


Therefore the infinite sum. 


t \f ‘ Hds + P^^ 

i=m 


exists and is finite by the Monotone Sequence Theorem (Eolland 2002, Thm. 1.16). Thus each coordinate of each term 
in (92) tends to a unique limit as f —F, i.e. the following limit exists: 

q := limq^(f). 

t^t 


□ 


Let Z GL J denote the set of modes visited infinitely often by %. Since the sequence o{x) = {■4}7=i C of 
discrete modes visited by % is an infinite sequence of elements taken from a finite set, Z 

Although the previous result guarantees the continuous state of the system associated with a Zeno execution has a 
well defined limit, we must further guarantee that it limits on a consistent mode as well. The following result guarantees 
that this limiting state is indeed achieved in a well defined mode which is also physically meaningful in the sense of 
being composed of any and all of the constraints that had been active infinitely often during the Zeno execution. 

Corollary 1. LetZ = \^Z denote the set of all constraints visited during a Zeno exection X - T —> D. The set is a valid 
mode, representing the asymptotic contact mode, and the zeno limit (q,q) from (86) lies in the domain D^, i-e., 

zg J, X ■= (qjq) G£>z- 

Proof We continue with the notational conventions from the Proof of Theorem 10. Eor all Z G Z, let Wz = 
A|(AzM^‘a|)-'Az and note: Wz = W| > 0; WzM^'Wz = Wz; q(f7) = Id - Vy,)q(fr) for all i G N; 
and 3 Sz such that (AzM = S^Sz. Impacts do not increase energy since for all i G N: 

^q(fr)^Mq(fr) -q(f+)^Mq(f+) =\q{ti fMq{tr) - iq(fr)^(Id-Wy,M-‘)M(Id-M-‘Wy,)q(fr) 

= ^q(fr)^(M-M +Wy, -f Wy, - Wy,M^ Vy,)q(fr) 

=^q(fr)^(2Wy, - Wy,)q(fr) = ^q(fr)^Wy,q(fr) > 0. 

Equation (88) implies impacts must extract a finite amount of energy, 

£q(fr)^Wy,q(fr)<-. (94) 

i=i 
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and hence in particular, 


(95) 


limq(f,. )^Wy,q(f,. )=0. 

l-^oo 

Taking (95) together with, 

q^Wzq = q^A|(AzM-'A^)-'Azq = ISzAzql^j, 
implies each Zeno constraint is asymptotically satisfied; 

Vz G Z = MZ : liinA,q(f) = 0. (96) 

For all constraints j G Z,j G Zj for at least one Zj G Z visited infinitely often in By the definition of Dzp 

Vz G Z : lima,(q(f)) = 0. ( 97 ) 

and therefore j G * 7^ 0- Furthermore for all constraints j GZ and all modes Zj G Z containing y, the definition of 
mode Zj, (71), requires that a ( 7 ) G Zj C Z. Therefore Z G by (71). The domain has three requirements, (74), 
two of which we have already shown to be met by % in (96) & (97). Finally, as in (97), for all constraints k G ICn, and 
all modes Z G Z, by definition of Dz, 

V^G/C„ ;lima^(q(f)) >0. ( 98 ) 

Thus^ = (q,q) GDz. □ 

3.5.2 Effect of pseudo-impulse on Zeno executions 

As suggested in Section 2.7, the inclusion of the pseudo-impulse prevents an infinite number of liftoff transitions in a 
finite amount of time from constraints impinged upon by external forces. 

Theorem 11. Let j : 7^ — > 27 7>e a Zeno execution of a self-manipulation hybrid dynamical system with exactly two 
contact constraints, so that the limiting set Z = J. Under the hypotheses and notation of Theorems 10 & Corollary 1, 
when the pseudo-impulse parameter is positive, 5t > 0, we conclude that, 

VzGZ;U,(^A|(q)(^T(q,q)-C(q,q)-N(q,q))) <0, (99) 

that is, the constraint forces cannot be positive for either constraint at the Zeno limit point. 

Proof We know lim,^, A 2 q(f) = 0 for all z G Z by (96). When the liftoff velocity drops below the threshold given 
implicitly by Theorem 3, the pseudo-impulse prevents liftoff from constraint i G Z if it violates (99) (i.e., meets the 
condition ( 66 )). Therefore, the contact force must be negative for both constraints z G Z that undergoes an infinite 
number of liftoff transitions. □ 


4 Discussion 

In this section we explore the limitations in physical scope incurred by the twelve assumptions of Section 2, and discuss 
some consequences of the results of this paper through additional examples. Specifically, Section 4.1 reviews on a 
conceptual level the meaning and implications of the original assumptions. Then Sections 4.2 &. 4.3 use a number of 
specific physical examples to elucidate the nature and origin of the more conservative restrictions that are helpful but, 
we speculate, not necessary to achieve the results of interest in Section 3. Finally, Section 4.4 explores the relationship 
of these assumptions to the Zeno results of Section 3.5. 
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4.1 The Base Assumptions 

Most of the assumptions listed in Section 2 are quite common in the modeling of physical systems, especially for 
models focused on robotics. These limit the scope of the physical settings of interest, and while there are certainly 
examples of robots that would be poorly modeled by each (many of which are explored in Section 1 .2), we believe 
that there remains a large class of systems that are covered by most if not all of these. Specifically, this class at its 
core consists of rigid bodies (Assumption Al) under Lagrangian dynamics (A7) with analytic (A3), independent (A2) 
constraints. That these constraints persist (A4) and are added through plastic impact (A8) are certainly domain specific 
assumptions, but many robotic tasks involve touching an object or the environment with the goal of continuing that 
contact in order to do some work. 

With these assumptions in place, solving a complementarity problem (A9 & A10) is the most direct and mathe¬ 
matically tractable way to formulate the change in contact conditions and is in step with a large literature (reviewed in 
Section 1.2.3). Our insistence on unique solutions to the these problems, key to the consistency conclusions of Sec¬ 
tion 3.4, has poorly understood consequences except for the case of independent plastic frictionless contacts for which 
these assumptions are known to hold. The unique structure of the complementarity problems used here allows for the 
inclusion of other assumptions (in particular massless limbs and the pseudo-impulse, A6 and All), and Theorems 1 
& 2 (along with Lemma 9) ensures that this form of the complementarity assumptions agrees with the more common 
versions. 

Assumption A7 is imposed both in the interest of the physical scope (Lagrangian dynamics) and mathematical con¬ 
sistency. It arises from the same motivation as the familiar conditions that preclude finite escape in classical dynamical 
systems but must nevertheless be couched in more technically involved language because of the hybrid setting. Thus 
we have found it expedient to provide further analysis of what is left behind; the results of Lemmas 6 and 7 guarantee 
the admissibility of most physically interesting problem instances, but bar (for reasons reflecting the need for a more 
technical framework, we suspect, rather than mathematical necessity) only the case of nonholonomically constrained 
massless links. 

The remaining assumptions are not imposed to facilitate the definitions and consistency proofs underlying the 
formal hybrid system (Section 3), but, rather, relate to the practicality of the physical models they can support. We 
have found the following simplifying (and, strictly speaking, physical fidelity diminishing) assumptions critical to 
not merely the mathematical tractability but also the qualitative accuracy of the models we use in the robotic settings 
of interest (as exemplified by the illustrative cases explored below). Thus, the formal results of this paper have 
been adapted wherever possible to allow for their consistent inclusion. In particular the massless leg assumptions, 
A5 and A6, are sometimes made for mathematical tractability, but often are not analyzed carefully. Briefly, A5 is 
tantamount to the assertion that the Lagrange D’Alembert formulation of constrained mechanics should admit smooth 
generalized coordinates relative to which the kinetic energy is nonsingular, while Assumption A6 similarly requires 
that any massless degrees of freedom not in contact be assigned some reasonable dynamics. Lemmas 4, 5, & 8 and 
Theorems 1 & 2 all concern the inclusion of massless limbs with the other assumptions, and the theorems of Section 3.4 
ensure that the resulting system is consistent. 

Friction in various forms is a common modeling assumption, however the specific setup in Assumption A12 (which 
divides contacts into either completely slipping or completely sticking but precludes sliding-to-sticking transitions) 
consists of common components but in a very restricted manner. This particular combination will evidently not 
be the best model of friction for many systems. The existence and uniqueness of solutions to the corresponding 
complementarity problems for this setup has never been demonstrated and therefore is simply assumed in this paper. 

Finally, the practical and theoretical implications of the new pseudo-impulse model assumed in Al 1 have a com¬ 
plicated interplay to whose exploration we devote Section 4.3. 

4.2 Massless Limbs 

One common set of circumstances that satisfy the requirements of the massless limbs assumption (A5) arises when 
only the robot’s most distal link (the finger, lower leg, foot, or in the case of RHex, the entire leg) is massless and the 
motion of its most distal point is completely constrained when it is on the ground. Although the rank requirement is 
not limited to this setting, it represents the immediate motivation for our inquiry. 

Though there are no truly massless limbs, computing the dynamics using (12)-(13) is numerically more stable 
than inverting Mg in the presence of large disparities in limb segment masses (Holmes et al. 2006, Sec. 4.3). This is 
evidenced by an order of magnitude improvement in the condition number (ratio of largest to smallest singular values) 
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Figure 7: Left: A point sliding along ground G approaches hill H. Right: Free body diagram showing impulses at 
point of contact. Without P no positive impulse from the ground Pg is possible for any initial momentum Mq^ and 
any hill slope 9 < 90°. 
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Figure 8: A rocking block (height h = 10cm, width w = 5cm, mass m = 5kg) settling on the ground. Top Row: 
Without pseudo-impulse (5; = 0). Bottom Row: With pseudo-impulse (5, = 0.03). The execution is identical until the 
last frame. 


for the RFIex model used here (Johnson 2014, Sec. 5.1.1). 


4.3 Pseudo-Impulse 

As a simple example that motivates the need for the pseudo-impulse (Assumption All), consider a point sliding on the 
ground as in Figure 7, which hits a hill at some slope 9. The contact impulse from the hill P// will cause the particle 
to break contact with the ground and leave with some velocity sliding up the hill. This is true for any initial velocity, 
no matter how small (Lemma 10), and any 9 < 90°. With a pseudo-impulse P acting in the direction of gravity, (56), 
Theorem 3 states that there are initial conditions that result in the point coming to rest with impulses from both the 
ground and the hill (i.e., all impulses are positive and sum to zero in Figure 7). Note that in this case all quantities 
scale linearly with mass and as such the solution is the same for any mass. 

To see how the pseudo-impulse resolves Zeno executions, consider the “rocking block” example of a rectangular 
rigid body in Figure 8 of width w, height h, mass and inertia 4 (where if a uniform distribution is assumed 
4 = mi,{w^ + h^)/\2), as studied in e.g. Housner (1963), Lygeros et al. (2003), McGeer and Palmer (1989), Yilmaz 
et al. (2009). As it is falling onto the ground if a corner (labeled “1”) is touching down^^ then the normal direction 
impulse at that corner when the other corner (labeled “r”) hits the ground is. 


/px ^ z{2Ih + mhiw^-h^)/2) 


( 100 ) 


(note that by convention a positive velocity z is one that is towards the ground) and the required impulse will be 

In this example the contact points are assumed to resist sliding friction, although when they are both in contact with the ground one of the 
redundant tangential constraints is dropped. The phenomenon of interest occurs equally well with frictionless contact however the analysis is 
simpler in the frictional case as presented here. 
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Figure 9: Comparison of the vertical velocity of a settling block for evaluations with and without the pseudo-impulse. 
The execution is identical until the impact at f = 0.27s. The pseudo-impulse implicitly bounds the vertical velocity 
such that an impact at speeds lower than 6.3cm/s will cause the block to come to rest, as indicated by the dotted line. 


negative if, 


Mb 


( 101 ) 


in which case the contact will be broken no matter how slow the block is moving - this is exactly what Lemma 10 
predicts. The system will exhibit Zeno behavior requiring infinite transitions in finite time as each impact removes 
some energy but does not immobilize the block, as can be seen in the upper row of Figure 9 which plots the vertical 
velocity as the system undergoes a Zeno execution. 

Instead if the pseudo-impulse is considered. 


U,(P + P) 


z{2Ih + mb{w^^ 5imbg 
0 ' ^ 


the contact will be broken if. 


, 2 ^ 2 I 

h > w H-- 

mt 


5,gw^ 


■U,(P + P)<0, 


( 102 ) 


(103) 


where as the speed goes to zero (z 0) the threshold on height that will allow the contact to persist will grow and 
eventually be met - this is exactly the case considered in Theorem 3. This truncation of the Zeno execution can be 
seen in the lower row of Figure 9, where for the dimensions used the block will come to rest if the vertical speed at 
impact is less than (i.'icmjs. 

Finally, note that the (somewhat restrictive) result in Theorem 11 applies exactly to this rocking block example. The 
word associated with the Zeno execution of interest alternates between the left and right constraints being active, i.e.. 




(104) 


Since the (gravitational) force violates (99), any value of the pseudo-impulse parameter 5, > 0 will prevent an infinite 
number of liftoff (and hence touchdown) transitions for either constraint. We conclude in this case that inclusion 
of the pseudo-impulse has the effect of truncating the Zeno execution, i.e. preventing an infinite number of discrete 
transitions in finite time, as explored further in the next section. 


4.4 Zeno convergence results 

It is instructive to contrast Theorems 10 and 11 and Corollary 1 with the completion results in Or and Ames (201 1). 
For Lagrangian systems subject to plastic impact with a single unilateral constraint, the completion of the simple 
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Lagrangian hybrid system in Or and Ames (2011) coincides with the self-manipulation hybrid system we develop 
in Section 3.2. Specifically, the completion is a hybrid dynamical system (in the sense of Definition 2 with one 
constrained and one unconstrained mode. Transition from the unconstrained to the constrained mode occurs at impact; 
outward-trending forces trigger the transition back to the unconstrained mode. Our self-manipulation system can 
therefore be viewed as a generalization of the completion to Lagrangian systems undergoing plastic impact with an 
arbitrary number of unilateral constraints, a situation not considered in Or and Ames (2011). In connection with 
the numerical simulation literature discussed in Section 1.2.1, we note that (Stewart 1998, Lem. 12) provides an 
accumulation result for time-stepping algorithms that is analogous to our Corollary 1 . 

We further clarify the relationship between our contributions and the results in Or and Ames (2011) in the case 
of purely inelastic (i.e., plastic) impact. Although the (un-completed) simple Lagrangian hybrid system allows plastic 
impacts (i.e., a coefficient of restitution e = 0 in (Or and Ames 2011, Eqn. 4)), the definition of the guard in (Or and 
Ames 2011, Sec. II-A-3) implies that every plastic impact is a Zeno event - every (q,q) G TQ for which a(q) = 0 
and A(q)q = 0 is a fixed point of the reset map in (Or and Ames 2011, Eqn. 4). This stands in contrast to our guard 
definition (76)-(77), where we have excised such points from the domains of the reset maps. In plain language, we 
ensure that constraints may persist after an impact without instantaneously triggering Zeno events. As an illustration, 
consider just a single impact event in the rocking block of Eigure 8. In the simple Lagrangian hybrid system of Or and 
Ames (201 1), plastic impact at time f € R results in a Zeno execution over a hybrid time trajectory T =IJ^i {f} that 
spans zero (continuous) time, thus conflicting with Theorem 8 (before, possibly, completion and continuing exection 
to the Zeno execution considered in Section 4.3). In our self-manipulation hybrid system, the execution continues past 
this impact as illustrated in Eigure 9 (top) by transitioning to a constrained mode. We note that the behavior of our 
system (and, equivalently, the completion from Or and Ames (2011)) is consistent with the analysis of the rocking 
block in (Housner 1963, Sec. 2). 

We also comment on the relationship between the truncation effect introduced by our pseudo-impulse and the 
reliable truncation proposed in (Or and Ames 2011, Def. 6). The pseudo-impulse we proposed in Section 2.7 prevents 
an infinite number of isolated liftoffs in finite time from pairs of constraints impinged upon by the external forces; 
this is the content of Theorem 11 . In (Or and Ames 2011, Def. 6), reliable truncation conditions were shown to yield 
simulated executions that approximate a Zeno execution to specified precision; this is the content of (Or and Ames 
2011, Thm. 3). Thus our contribution is a phenomenological heuristic that augments the hybrid system to prevent some 
Zeno executions from arising (specifically, those Zeno executions that only involve two constraints impinged upon by 
external forces). The contribution in (Or and Ames 2011, Sec. V) is a formal guarantee of simulation accuracy for 
Zeno executions in the original hybrid system. We have yet to determine the “reliability” of our truncation in this sense 
(though, as noted in the discussion following Theorem 11 , the psuedo-impulse truncation is reliable in this sense for 
the rocking block). 

It is possible to relax the hypotheses in Theorem 10 in several ways that ensure the results in Section 3.5.1 still 
hold. It is straightforward to allow time-dependent forcing (as in (Ballard 2000, Thm. 10)) so long as the applied and 
potential forces obey the estimate, 

V (q,q) GTQ: |T(f,q,q) -N(f,q,q)|j^-i < i{t){l + |q|M+ %(qo,q)), 

where £ : R —M is nonnegative and locally integrable. Eully-actuated massless limbs can be included by constraining 
their motion with respect to the body degrees-of-freedom, e.g. through the use of “mirror laws” (Buehler et al. 1994), 
so long as the forces required to enforce the desired motion obeys the estimate in (4.4). Care must be taken to allow 
the forcing to depend on the contact mode, since it is possible to introduce “sliding modes” wherein limbs cycle 
infinitely often between constrained and unconstrained modes at a single time instant; we discuss this issue further in 
Section 4.5. 

Einally, as we have not yet been able to construct an example wherein a constraint that meets (99) is involved in 
a Zeno execution, we speculate that the pseudo-impulse truncates a larger class of Zeno executions than handled by 
Theorem 11. Such an extension would require a careful treatment of the interaction between the complementarity 
conditions and the Zeno execution, resulting in either a proof that (99) can not hold for a constraint undergoing Zeno 
or conditions for Zeno executions involving more than two contact constraints. 

4.5 Contact-dependent forcing 

The developments in Sections 2 and 3 allow, in principle, for the applied forces Y/ to depend on the set of active 
constraints I G J. This is a desirable feature of our formalism since many extant robots sense their contact state with 


38 


t=0.0 




t=0.15 




t=0.3 


t=0.05 




t=0.2 




t=0.35 


t=0.1 



t=0.25 



IsJ 


Scale; — lOcm — lOOA^ — IQQcm/s 


Figure 10: Keyframes from RHex simulation leaping onto a 73cm ledge. Blue arrows show contact forces while the 
red arrow shows body velocity. The coefficient of friction is ji — 0.8 and the relative leg timing is f 2 = 0.06s. 


the world and accordingly alter their actuator commands. This also enables the separate handling of massless limbs 
that make or break contact with the ground. Assumptions A5 and A6. The hybrid system formalism provides a direct 
route to incorporate this sort of feedback. Indeed, so long as one can ensure that the complementarity assumptions 
hold, A9 and AlO, then the self-manipulation system (Def. 5) has disjoint guards (Thm. 5) and hence is deterministic 
(Thm. 6) and non-blocking (Thm. 7). Note, however, that the complementarity problems do not depend on contact 
mode, and so care must be used to enable contact-dependent forcing that does not break these assumptions, otherwise 
the execution may alternate between two adjacent modes. 


5 Conclusion 

The hybrid system model presented here provides for the consistent inclusion of many common simplifying physi¬ 
cal assumptions, including rigid bodies and plastic impacts, as well as some less common assumptions, such as the 
pseudo-impulse. These assumptions are well understood to be only approximations to the real physics: our central 
contribution is to develop sufficiently compatible refinements of previously investigated versions as to obviate their 
erstwhile conflicts. Nevertheless, this refined model is still able to capture qualitatively many behaviors of interest in 
robotics - not merely the familiar steady state tasks (e.g., Buehler et al. (1994), Holmes et al. (2006)) but also tran¬ 
sitional maneuvers such as, archetypally, the leap onto a ledge shown in Figure 10 (a behavior first demonstrated in 
Johnson and Koditschek (2013b)). Simulation results such as these suggest the descriptive power of our refined col¬ 
lection of physical assumptions, while the consistency properties of Section 3.4 ensure that they avoid these potential 
conflicts. However, as noted in the text, there remain a few cases where the formal proofs included here are limited 
to a still further constrained subset of mechanical settings than admitted by these assumptions, most notably Theo¬ 
rems 3, 8, 10, (& 11. As noted at several points throughout the text, we believe that the conclusions remain true under 
the broader conditions (i.e., those listed in the assumptions themselves), but more general proofs of these properties 
remain an open research question. 

Including such explicit assumptions makes it clear that certain extensions of this model satisfying these assump¬ 
tions are trivially admissible, such as more complicated rolling contact conditions, (Murray et al. 1994, Sec. 5.2.1), 
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over rough (though still semi-analytic) terrain. At the same time extensions that violate these assumptions will require 
that some of the formal proofs be reconsidered, for example elastic impacts. One assumption that is often relaxed is 
the persistence of contact. Assumption A4, which precludes the use of time-stepping formulations (e.g., presented in 
Anitescu and Potra (1997), Stewart and Trinkle (1996)) that have gained in popularity as a modeling and simulation 
framework. The remaining assumptions do not explicitly depend on Assumption A4, and so it may be possible to 
extend some of the results from Section 2 to these settings (although the massless leg conditions. Assumptions A5 
& A6, and the pseudo-impulse assumption. Assumption All, may prove challenging to maintain). 

Indexing the contact mode as a subset of the possible contact constraints suggests a natural simplicial topology 
(Hatcher 2002) over these contact modes, as first suggested in Johnson and Koditschek (2013b). This organization 
of the hybrid system should enable the inspection of structural properties of the system as a whole. Furthermore the 
various guard sets imply a refinement of the domains into disjoint sets that reach a unique next guard (or remain in that 
mode forever). More broadly, we believe that this physically motivated hybrid system definition, formal consistency 
now established, invites study as a mathematical object whose properties may likely yield formal insights into the 
nature of these mechanical systems and promote the design of more complex robot behaviors that can exploit them. 
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Appendix 

A Proof of Lemma 2 

Proof. First consider {h{x) -<f 0) {g{x)h{x) -<f 0). Let m/, be the index of the first nonzero derivative in the 
definition of h{x) -<f 0, (3), and so for all £ < m/, the f'* Lie derivative is zero, {Lph)(x) = 0. Therefore we also have, 

■ h)) W = L = L i^P''g{x)) ■ (0) = 0, (105) 

and similarly, 

{Cp''g ■ h){x) = ^ j {Cp''^'‘g{x)) ■ (Cphix)) = g{x) ■ {Cp''h{x)), (106) 

where since g{x) > 0, g{x) ■ {C'p'h{x)) < 0 {£p''h{x)) < 0. Therefore {h{x) -<f 0) {g{x)h{x) -<p 0). 

Now consider {g{x)h{x) -<f 0) ^ {hix) -<f 0). Let nigh be the index of the first nonzero derivative in the definition 
of g(x) ■ h{x) ^p 0, (3). The proof proceeds by strong induction on i, where 0 < i< nigh, relative to the proposition, 

[Cpg • h) (x) = g{x)Cph{x), (107) 

i.e. that the Lie derivative of the product is equal to the product of positive function, g{x), with the Lie derivative 
of the other factor, h{x). The base case is trivial, as for the 0'^’ derivative, 

{^Fg-h){x) = g{x)Cph{x) = g{x)h{x). (108) 

If nigh = 0, then g(x) ■ h(x) < 0, but since g(x) > 0 we get that h(x) < 0 and therefore h(x) -< 0. If instead nig/, > 0, 
then g(x) ■ h{x) = 0 and therefore h{x) = 0. For the inductive step, suppose that the statement is true for all k < £, 
implying that, 

{Cpg ■ h) (x) = g{x)Cph{x) = 0, (109) 

(as recall that £ < nig/,), and therefore, Cpli{x) = 0. Thus, 

{C‘p{g-h))ix) = Y, (Jj (^F^^g(x)) ■ {^fKx)) =g{x)-{Cph{x)), (110) 
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and the induction complete, we now conclude that the proposition, (107), holds for all I < 

If g{x)li{x) -<F 0, then by (3) for all £ < {^C^p{g-h)(x)) = 0, and so using(107) andg(;ii:) > 0, we can conclude 

that Cpli{x) = 0. Similarly for £ = mg/,, • h){x)) < 0 Cp^''h{x) < 0. Taken together, these are exactly the 

conditions for h{x) -<f 0, (3), and so {g{x)h{x) -<f 0) {h{x) -<f 0). 

□ 


B Linear Algebra 


For additional notes on the Schur complement and block matrix inverse, see e.g. Cottle (1974), Lu and Shiou (2002), 
or Jo et al. (2004). Consider a block matrix M defined as. 


M;= 


E F 
G H 


If E is nonsingular, then the Schur complement of £ in M is, 

Se ■.= h-ge-'^f, 


which is sometimes written as {M\E). 

If M is also nonsingular, the inverse of M is. 


■ E 

F ' 

-1 

■ E-'- +E-'^FSp'^GE-'^ 

-E-'-FSp^ ' 

G 

H 


-Sp^GE-'^ 

0-1 

J 


( 111 ) 


( 112 ) 


(113) 


In particular when M is invertible the block matrix inverse of (8) can be written as. 


M) 

1 

- 1 - 

< 


■ M 

A^ 1 

-1 

M ‘-M ‘a^(AM ‘a^)“‘AM ‘ 

M 'a^{AM ‘a^)-‘ 


Ay 


. Ay 

Oyxy 


(am“'a^)-‘am“' 

-(am“‘a^)-' 


(114) 


Where when M is positive definite, so is (AM and therefore A is negative definite. Similarly, when M is 

only positive semi-definite, A will be negative semi-definite. 

A common refinement to this inverse that will come up when considering some constraint sets J and K such that 
K = JG{k} is22. 


^K = 


Ay 

Air 


(115) 


M 

Aif 


^KxK 


1 


■ M AJ ■ 


r 1 


= 


Ay Oyxy 


Oyxi 




A*. Oixy 

0 



(116) 


m] a7' 
a] Ay 


Icl+ 






mJ a7' 

A) Ay 


Ay 


m] 

1 = 

Uo' 

1_1 

1-1 

V < 


”m) Af 
a; Ay_ 

+ 

1 1 

S£'[A*Mi Ay-Af] - 

m)a[ 

AiA[_ 

S~^ 


C-1 



-^^‘[AtM 

■ AyAf] 

Se' 



r [mJ+mJa[s£‘a*mJ] [a7+m;a[5£‘a*a7 -m)a[S£> ] 


A;+A]A[S£‘A,My 


-S£‘Ay.M) 


Se '=0— [A;t o] 


m] Af ■ 
a] Ay 


tr 


- 57 ^ At A 


= -a,mIaI. 


V ^ A 7" 


m; Af 

A^ Ak 


(117) 

(118) 


Note that when both the matrix and the first block in (116) are invertible, Se must be non-zero as Sp^ is an element 
of the inverse in (117). Since Ak is negative semi-definite, so are its principle minors, in particular Sp^. Therefore 

AjtMjAg, as a positive semi-definite and non-zero scalar, is a positive number. This final expansion, (117), expresses 
the components of A^,M^, and Ak in terms of Aj,Mj, and Ay together with the added constraint A^.. 

Note that the Schur complement, Se, used here is with respect to the blocks used in (116) as defined explicitly in (118). 
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B.l Proof of Lemma 4 


Proof. Define some set of generalized coordinates (as in (Johnson and Koditschek 2013a, Sec. II.G)), y, such that 
y = Yq and that the Jacobian of the corresponding implicit function is defined so that q = Hy, 


H = 


A 

-1 

0 

Y 


w 


(119) 


For this proof we need to show that M = H^MH is invertible if and only if is. The Rank Theorem (Lee 

2012, Thm. 4.12) implies there exists a parameterization such that the constraint can be decoupled into a full rank c x c 
subblock, A = [B Ocxe], and therefore we choose a parameterization such that, Y = [Ogxe We]. Thus, 


B 

^cxe 

-1 

0 


B ' 

Ocxe 

0 


0 

^exc 

1 


Ide 


^exc 

Id, 

idg 


Ide 


( 120 ) 


M = H^MH = 


0 We 


■ Mil 

M12 

0 

M21 

M22 

Me 


= M22, 


and so the requirement is that M is invertible reduces down to simply requiring that M 22 is invertible. 
On the other hand we have. 


■ M 

A^ ■ 


■ Mu 

Mi2 

B^ 

A 

0 

= 

M21 

M22 

0 




B 

0 

0 


( 121 ) 


( 122 ) 


Since B is full rank then the invertability of this matrix again reduces to simply invertibility of M 22 (e.g., (Lu and 
Shiou 2002, Corollary 3.3)), and thus the conditions are equivalent. □ 


B.2 Proof of Lemma 5 


Proof. Recall that limE^oMr = M and that Mg is invertible for all e € (0,e), for some e > 0. For all e > 0, define 
Mg, A|, and Ag by replacing M with Mg in (8). Using (14)-(16) we can rewrite the dynamics, (12)-(13), 


A = A^ (Y — Cq — N) — AAq = lirn A^ (Y — Cq — N) — AgAq 
= liin ((AM;‘A^)-'AMg‘) (Y-Cq-N) + (AMg‘A^)-'Aq 
= lim(AMg‘ a^)-‘ (AMg ‘ (Y-Cq-N) +Aq) , 
q = M^ (Y - Cq - N) - A^^Aq = lim Mg (Y - Cq - N) - Af Aq 
= liin (M;'-M;‘A^(AM;‘A^)-'AM;')(Y-Cq-N)-M;‘A^(AM;‘A^)-'Aq 
= lim Mg ‘ (y - Cq - N - A^ ((AM^^ A^)-* (AM^^ (Y - Cq - N) + Aq))) 

= liin Mg ‘ (Y - Cq - N - A^A) , 


(123) 

(124) 

(125) 

(126) 

(127) 

(128) 
(129) 


where (125) and (129) are identically equal to the desired formulation of (17) and (18) when Mq is non-singular. □ 


B.3 Proof of Lemma 8 

Proof. Recall that limg^o Mg = M and that Mg is invertible for all e G (0, e), for some £ > 0. For all £ > 0, define Mg, 
A|, and Ag by replacing M with Mg in (8). Then using equation (15) we can rewrite the impulse (where all constraints 
A are taken to be for the target contact mode 7), 

P;l = -AAq- = lim -AgAq- = lim(AMg‘A^)-^Aq-, (130) 

£->0 £->0 

which is identically equal to (26) when Mq is non-singular. □ 
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C Differential Topology 

Let M be a C'^ manifold with boundary where r G N U { 0 °, 0)}. There are several natural constructions associated with 
M we invoke repeatedly, so we briefly introduce them here and refer the reader to Lee (2012) for formal definitions. 
At every point x G M there is an associated tangent space TxM, which is a vector space with the same dimension as 
M, i.e. dimTxM = dimM; if M is a submanifold in a Euclidean space of suitable dimension, TxM may be regarded as 
a hypersurface in the ambient Euclidean space. Collating these tangent spaces yields the tangent bundle TM = WxeM 
TxM, which is naturally a C'' manifold with boundary whose dimension is twice that of M, i.e. dimTM = 2 dimM. 
There is a canonical projection n \ TM ^ M that simply “forgets” the tangent vector portion of a point (x, v') GTM, 
i.e. n (x, v) = x. At every point x G M there is an associated cotangent space T*M, which is the dual of the vector space 
TxM (i.e. every v G T*M is a linear operator v : TxM R). Collating these cotangent spaces yields the cotangent 
bundle T*M =IXrGM T*M, which is naturally a C'^ manifold with boundary whose dimension is twice that of M, i.e. 
dim7’*M = 2 dimM. A C'" map F :M ^TM is called a vector field iinoF = \Am where n: TM ^ Mis the canonical 
projection and Id ; M —M is the canonical identity function. Given a C* map f : M ^ N between C'^ manifolds, 
there is an associated pushforward map Df : TM ^ TN between their tangent spaces that evaluates to a linear map 
Df{x) : TxM Tf^x)N at every x G M; in coordinates, Df is the familiar Jacobian derivative of /. The rank of / at 
X G M is defined to be the rank of the linear operator Df(x)\ if the rank of / does not vary over M, it is called constant 
rank. 

By the Whitney Embedding Theorem (Lee 2012, Thm. 6.15) (if r G NU {°°}) or the Nash Embedding Theorem 
(Nash 1966) (if r = co), M admits a C'^ embedding i : M ^ thus any C manifold may be regarded as a 

submanifold of a Euclidean space of suitably high dimension. Since F is C'^, the pushforward DioF admits a C'^ 
extension F : M ^ TM over an open neighborhood M C of the embedded image of M. The Eundamental 

Theorem on Elows (Lee 2012, Thm. 9.12) implies there exists a maximal flow <I> G C''{0,M) for F where (9 C R x M 
is the maximal flow domain. We may restrict <I> to obtain a flow overM as follows. Eor each x G M, let, 

fljc = inf {f < 0 I (f, i(x)) G (9 A V i G (f,0] ; <I>(i,l(x)) G i(M)}, 
bx = sup{f > 0 I (f,l(x)) G (9 A Vi G [0,f) : 4>(i,l(x)) G l(M)}. 

Let C R be the interval between ax and bx, including the endpoint if the corresponding infimum or supremum 
is achieved. Then let O = C R x M and, noting that <I>(f,i(x)) G i(M) if (f,x) G (9, define the flow 

<I>; (9 —> M by <I>(f,x) = i(x))). Note that <t> is C'^ in the sense that i oO admits a C' extension, <I>. 

Eor any G C dM, let, 

// = {xGM|3f>0: (f,x) G (9A<I>(f,x) G G}. 

Define 77 ; R by, 

Vx G // : T 7 (x) = infjf > 0 I (f,x) G (9 A<I>(f,x) G G}, 

and 17 /: // —7 G by y/jx) = 'I>(77(x),x) for all xG H. Letting H = {x G H : F{\i/{x)) ^ 7’^(j.) 5M}, it is clear that J ?|/7 G 
C'^(//,R). Note that 77 is not differentiable at any point x G H for which F{\if{x)) G 7’^(j.)(9M; changing coordinates to 
a flowbox makes this obvious. Intuitively, the impact time has unbounded sensitivity to initial conditions near such a 
point of tangency. 

D Hybrid Differential Topology 

In mechanical systems undergoing intermittent contact with the environment (i.e., terrain or objects), the dynamics 
are “piecewise-defined” (or hybrid): whenever a limb attaches or detaches from the substrate there is an instanta¬ 
neous change in the set of active constraints, leading in general to a discontinuous change in velocity and (constraint) 
force. Though it is possible to analyze these discontinuous dynamics in the ambient tangent bundle as in Ballard 
( 2000 ), introducing a distinct portion of state space associated with every contact mode renders both the continuous¬ 
time dynamics (given by the flow of a vector field) and discrete-time dynamics (specified by a reset map) smooth. 
Thus although additional notational overhead is required to index the constituent dynamical elements, the extra ef¬ 
fort is partially compensated by enabling the use of elementary constructions from differential topology (rather than 
sophisticated measure-theoretic techniques used in Ballard (2000)). 
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Motivated largely by these observations, Burden, Revzen and Sastry (2015) proposed to define hybrid dynamical 
systems over a finite disjoint union, 

M =\ljej Mj=\J {7} xMj = {(7,x) :J€j,x€ Mj }, 

where Mj is a finite dimensional C'^ manifold (possibly with corners) for each 7 € JT. By endowing M with the unique 
largest topology with respect to which the (canonical) inclusions Mj ^ M aie continuous (Lee 2012, Prop. A.25), 
the set M becomes a second-countable, Hausdorjf topological space which is locally Euclidean in the sense that each 
point X € M has a neighborhood that is homeomorphic to an open subset of M"' , some n^ € N. Since the dimension 
is no longer required to be fixed, M is technically not a topological manifold (Lee 2012, Chapter 1). However, it is a 
mild generalization"^, hence we refer to it as a hybrid topological manifold. 

Motivated by the self-manipulation system from Section 3, we extend the definition in Burden, Revzen and Sastry 
(2015) to allow the component manifolds Mj to possess corners. Unfortunately there is not presently a consensus on 
what ought to be the definition of a manifold with corners (Joyce 2012, Remark 2.11). Fortunately, for our purposes the 
most straightforward definition in (Lee 2012, Ch. 16) suffices. This variant, for instance, ensures smooth extensibility 
of maps between manifolds with corners; see the bottom paragraph of (Lee 2012, p. 27). (Note that the discussion of 
manifolds with boundary in (Hirsch 1976, Sec. 1.4) (termed (9-manifolds) does not address this, though (Hirsch 1976, 
Lem. 3.1 in Sec. 2.3) should make it unsurprising.) This coincides with (Joyce 2012, Def. 1, 2). 

For each 7 G Mj has an associated maximal C'' atlas Aj. We construct a maximal C'' hybrid atlas for M by 
collecting charts from the atlases on the components of M: 

-4 = {{{J} xU,(po7tj) :J G J, {U, (p) G Aj} , 

where Kj : {7} x Mj Mj is the canonical projection. We refer to the pair {M,A) as a C' hybrid manifold, but may 
suppress the atlas when it is clear from context. We define the hybrid tangent bundle as the disjoint union of the 
component tangent bundles, 

and the hybrid boundary as the disjoint union of the boundaries. 


dM =Wjej dMj. 

Let M =[Xj(zj Mj and N =[1lcC Nl be two hybrid manifolds. Note that if a map R:M N is continuous as a map 
between topological spaces, then for each J G J there exists LG C such that R{Mj) C Nl and hence R\mj ■ Mj N^. 
Using this observation, we define differentiability for continuous maps between hybrid manifolds. Namely, a map 
R:M ^ N is called C'^ if R is continuous and R\mj ■ Mj N isC for each J G J^. In this case the hybrid pushforward 
DR ; TM ^ TN is the C' map defined piecewise as DR\tmj = D{R\mj) for each J G J. AU map F : M ^ TM is 
called a hybrid vector field if noF = Wm where n : TM ^ M is the canonical projection and Id ; M M is the 
canonical identity function. 

E Proofs supporting Theorem 10 

E.l Proof that velocity is bounded 

The following are standard results used in the proof of Theorem 10 to prove that velocity is bounded. For convenience, 
we transcribe and apply to our setting these statements from Ballard (2000), which applies them to achieve a similar 
aim, however this is not to imply that Ballard (2000) is necessarily the original source of these results. 

Lemma 11 (Prop. 7 in Ballard (2000)). 7/'(q,q^): [ft) -gTQIs a right-continuous trajectory of a Lagrangian system 
subject to perfect unilateral constraints with efforts map f := Y — N, then for all t G [ffi): 

< [ tiqis),q^{s))q^{s)ds. 

M 

Since, crucially, each of the distinct finite components Mj is a conventional smooth C manifold (of necessarily fixed dimension). 
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Lemma 12 (Lem. 17 in Ballard (2000)). Let a : [f,f] —R be integrable and nonnegative for almost all t G (ft)- If 
0 • [L^] R bas bounded variation and. 


for some a >0 then. 


Vf G \t_f] ; 10(01 < a + 



Lemma 13 (Lem. 15 in Ballard (2000); Prop. 3.21 in Sastry (1999)). Let ai : \t_,t] R have bounded variation and 
a 2 : [ft] —>■ R integrable and nonnegative for almost all t G {fl). /f 0 : [f, F] —> R has bounded variation and. 


V t G [ft] : 0(0 < ai{t) + a 2 {s)^{s)ds, 


then, 

Vf G [f,F] : 0(f) < ai{t) + fli(s)a 2 (s)exp a 2 {<y)da^ ds. 

We apply the Lemmas above as in the proof of (Ballard 2000, Thm. 10) to establish that the velocity is bounded 
on finite time horizons. Let (q^,q) ; [lO ^2 ^e a right-continuous trajectory of a Lagrangian system subject to 
perfect unilateral constraints and with forces that satisfies the bound in (19). Lemma 11 yields for all f G L := [f ,f): 


1 ~+ 2 
X q (0 


M 


q^(0 < / f(q(0,q^(0)q^(0'^^- 

M Jt 


(131) 


Applying Lemma 12 with 0(f) = q (f) 
for t GT: 


_, « = 


M 


q (0 anda(s)= f(q(0,q (0) 

M 


M 


j combined with (19) implies 


q^(0 

< 

M 

q^(0 

_+/ 
M Jt 


< 

q^(0 

_+/ 

M Jt 


Recalling that %(q(f),q(0) < // q^(^) 


C 

ds we find. 


1 -f 


q (0 _+%(q(0:q(0) 


(132) 


ds. 


M 


'^M(q((.)7q(0) + q q iO ,^+c{t-t_)+(i+c) q (s) 


j^+'^M(q(c),q(0) 


ds. 


(133) 


Applying Lemma 13 with 0(f) = ii^(q(0,q(f)) + q (f)_,ai(s)= q (0 _ +C(f— 0. fl 2 (s) = 1 +C yields. 


M 


M 


'^M(q(0)q(0)+ q^(0 —< «i(0+ / ai(i)fl2(^)exp [ a2{cy)dc 

M Jt Js 


ds. 


(134) 


In particular, since the right-hand-side of the inequality is bounded on finite time horizons, velocity is also bounded 
on finite time horizons. 


V := sup 

teT 




<C 


(135) 


E.2 Integration-by-Parts 

Suppose (q,q) : [f,t]^ TQ satisfies (17). Then left-multiplying by M and rearranging, 

M(q)q + ^q,q)q = T(q,q)-N(q,q)-A^(q)A(q,q), 


where for all f, 7 G { 1 ,..., q}, 

C,7(q,q) 


qi/ aM,7(q) aM,,(q) aM,7(q) \ , 

^= 12 y dq^ dqi dq‘ j 


(136) 
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see (Murray et al. 1994, Eqn. 4.23) or (Johnson and Koditschek 2013a, Eqn. 30) for details. Note that for all i € 


[Mq + Cq]‘ = f [M„V]+ f 

j=i j.k=\ 


1 ( 5M,j(q) 5M,-^(q) 5M,ti(q) \ .k. 


dq^ 




= t [Moq^] + t 

y=i J-k=i 


dqi^ 2 dq' 


;=i 


j.k=i 


1 aM^y(q) . 


dq‘ 


q q 




dq‘ j 


'C'(q,q), 


q q" 


;=i 


where. 




' 14=1 


Therefore rearranging (136) we have for each i G {1,... ,q}. 


[M,yq^] = r-N'-C'-(A^l)'. 


(137) 


./=i 


Integrating both sides of (137) over the time interval reintroducing the dependence on (q,q) and time, and 
vectorizing over the index i, 

M(q(f))q(f) -M(q(f))q(0 = (^T(q(i),q(i)) -N(q(i),q(i)) - C(q(i),q(i)) - A(q(i))^A(q(i),q(i))^ ds, 

as used in (90). 

E.3 Proof that constraint forces and impulses are bounded 

The following is a transcription of the argument used in the proof of (Ballard 2000, Prop. 18) to show that constraint 
forces and impulses are bounded on bounded time intervals. 

Given a right-continuous trajectory (q,q ) ; [f ,f) —TQ of a Lagrangian system subject to perfect unilateral con¬ 
straints, we assume that; the inertia tensor M is nondegenerate; the position tends to a limit q := lim^^jq; and the 

velocity is bounded by v” := sup,gj | q^(0 < °° where T = This ensures there exists a compact neigh- 

LI M J _ 

borhood K cV such that q([f ,f)) C K and hence with B(0, v) C denoting the closed ball of radius v centered at 
the origin, the compact subset K' := B{0,v) x K C TV contains (q,q )([f,f)). This implies the following constants 
are finite: 

'r(q,q)-N^'(q,q; 

^M^,f(q) 


F 

:= max 

max 


MU..., 

q} 

G 

:= max max 


j,k,ee{l 

,...,q} qe^^ 


dqj 


Letting cTmax and Omin denote the maximum and minimum singular values of M over K, we obtain the follow¬ 
ing bounds; 

Y,Mjkiq)q'' 

i:=l 


max max 

(q.q)e/s:' 


^ V <7maxt'j 

V 

max max q-^ < 


"x/ Gfmin 


ye{i,...,q} (q.q)Gtr' 

Suppressing dependence on q and q^, we arrive at the bound that for each j G {1,..., |/C|}, 

q^Gv^ 


M- 


\t)q{t)-M\t^)q{tJ\ + fv 


- N - C^ds < 2x/(JmaxV + F 


2o„ 


(f - fm) < 


thus satisfying the condition on (93). 
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